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Abstract 

We begin a systematic study of how gaugino mass unification can be probed at the CERN 
Large Hadron Collider (LHC) in a quasi-model independent manner. As a first step in that 
direction we focus our attention on the theoretically well-motivated mirage pattern of gaugino 
masses, a one-parameter family of models of which universal (high scale) gaugino masses are 
a limiting case. We improve on previous methods to define an analytic expression for the 
metric on signature space and use it to study one-parameter deviations from universality in 
the gaugino sector, randomizing over other soft supersymmetry-breaking parameters. We put 
forward three ensembles of observables targeted at the physics of the gaugino sector, allowing 
for a determination of this non-universality parameter without reconstructing individual mass 
eigenvalues or the soft supersymmetry-breaking gaugino masses themselves. In this controlled 
environment we find that approximately 80% of the supersymmetric parameter space would give 
rise to a model for which our method will detect non-universality in the gaugino mass sector at 
the 10% level with 0(10 fb~^) of integrated luminosity. We discuss strategies for improving the 
method and for adding more realism in dealing with the actual experimental circumstances of 
the LHC. 



1 Introduction 



As the Large Hadron Collider (LHC) era fast approaches, the theoretical community is increasingly 
focused on how the new discoveries made there will be interpreted. The first step, most obviously, 
is to establish the presence of physics beyond the Standard Model. This will be done using search 
strategies that are by now well-established, though many interesting "what-if" scenarios continue 
to be proposed and investigated [Ij. We continue to believe that supersymmetry (SUSY) is the 
best-motivated extension to the Standard Model for physics at the LHC energy scale. Furthermore, 
if supersymmetry is indeed relevant at the electroweak scale there are many reasons to expect that 
its presence will be established early on in the LHC program [2j. Indeed, even some properties of 
the spectrum, such as the masses and spins of low-lying new states, may be crudely known even 
after relatively little integrated luminosity O SI [5] . In this paper we begin a research program into 
what comes next: how to connect the multiple LHC observations to organizing principles in some 
(high-energy) effective Lagrangian of underlying physics. 

This secondary problem can be further divided into two sub-problems. The first has come to be 
called the "inversion" problem. Briefly stated, the inversion problem is the recognition that even in 
very restrictive model frameworks it is quite likely that more than one set of model parameters will 
give predictions for LHC observations that are in good agreement with the experimental data [6]. 
Much recent work has focused on how to address this issue [3 El El flUl [TT] , and we will borrow 
much of the philosophy and many of the useful techniques from this recent literature. But our focus 
here is on what we might call the second sub-problem: how to turn the ensemble of distinct LHC 
signatures into a determination of certain broad properties of the underlying Lagrangian at low 
energies. Clearly the most direct attack on this second sub-problem is to perform a global fit to the 
parameters of a particular model [121 [13], modulo the degeneracy issue just described above. Not 
surprisingly, therefore, the work we will describe in this paper will make significant use of likelihood 
fits. But our ultimate goal is to fit to certain broad properties of the underlying physics itself - and 
not simply to a particular model of that physics. 

We will refine this rather vague-sounding goal in a moment. But it is helpful to first consider an 
example of what we mean by the phrase "broad properties of the underlying physics." Consider a 
high energy theorist interested in connecting the (supersymmetric) physics at the LHC to physics 
at an even higher energy scale, such as some underlying string theory. What sort of information 
would be of most use to him or her in this pursuit? Would it be a precise measurement of the 
gluino mass, or of the mass splitting in the top squark sector, or some other such measurement? 
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Obtaining such information is (at least in principle) possible at the LHC, but far more valuable 
would be knowledge of the size of the supersymmetric /i-parameter or whether tan (3 is very small. 
Such information is far more difficult to obtain at the LHC [H] but is more correlated with moduli 
stabilization and/or how the /^-parameter is generated in string models [15]. For example, this 
knowledge may tell us whether the /U-parameter is fundamental in the superpotential or generated 
via the Kahler potential as in the Giudice-Masiero mechanism [16j. This, in turn, is far more 
powerful in discriminating between potential string constructions than the gluino mass itself - no 
matter how accurately it is determined. We might refer to the genesis of the ^-parameter as a 
"broad property of the underlying physics." 

If all such key broad properties of the underlying physics were enumerated, it is our view that 
one of the most important such properties would be the question of gaugino mass universality. That 
is, the notion that at the energy scale at which supersymmetry breaking is transmitted to the ob- 
servable sector, the gauginos of the minimal supersymmetric Standard Model (MSSM) all acquired 
soft masses of the same magnitude. This issue is intimately related to another, perhaps equally 
important issue: the wave-function of the lightest supersymmetric particle, typically the lightest 
neutral gaugino. Few properties of the superpartner spectrum have more far-reaching implications 
for low-energy phenomenology, the nature of supersymmetry breaking, and the structure of the un- 
derlying physics Lagrangian [IT]. If the theorist could be told only one "result" from the LHC data 
the answer to the simple question "Is there evidence for gaugino mass universality?" might well be 
it. But these soft parameters are not themselves directly measurable at the LHC [18jll] One might 
consider performing a fit to some particular theory, such as minimal supergravity (mSUGRA), in 
which universal gaugino masses are assumed |21j - or perhaps to certain models with fixed, non- 
universal gaugino mass ratios [221 [23] . But we are not so much interested in whether mSUGRA - 
or any other particular theory for which gaugino mass universality is a feature - is a good fit to the 
data. Rather, we wish to know whether gaugino mass universality is a property of the underlying 
physics independent of all other properties of the model. From this example both the ambitiousness 
and the difficulty inherent in our task is clear. 

We have therefore decided to begin our attack by considering a concrete parametrization of 
non-universalities in soft gaugino masses. Many such frameworks present themselves, but we will 
choose a parametrization that has the virtue of also having a strong theoretical motivation from 

^Even a measurement of the physical gluino mass is not a direct measurement of the associated SU{3) soft mass 
M3. Quantum corrections to the gluino bare mass can be sizable and their theoretical computation involves a large 
set of other MSSM soft parameters [191 120] - which are also not directly measurable! 
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string theory. In recent work by Choi and Nilles [25 soft supersymmetry-breaking gaugino mass 
patterns were explored in a variety of string-motivated contexts. In particular, the so-called "mirage 
pattern" of gaugino masses provides an interesting case study in gaugino mass non-universality. 
Yet as mentioned above, these soft supersymmetry breaking parameters are not themselves directly 
measurable. Linking the soft parameters to the underlying Lagrangian is important, but without 
the crucial step of linking the parameters to the data itself it will be impossible to reconstruct the 
underlying physics from the LHC observations. 

The mirage paradigm gets its name from the fact that should the mirage pattern of gaugino 
masses be used as the low-energy boundary condition of the (one-loop) renormalization group 
equations then there will exist some high energy scale at which all three gaugino masses are identical. 
This unification has nothing to do with grand unification of gauge groups, however, and the gauge 
couplings will in general not unify at this particular energy scale - hence the name "mirage." 
The set of all such low-energy boundary conditions that satisfy the mirage condition defines a 
one-parameter family of models. This parameter can be taken to be the mirage unification scale 
itself, or some other parameter, such as the ratio between various contributions to the gaugino 
soft masses. We note that the minimal supergravity paradigm of soft supersymmetry breaking is 
itself a member of this family of models since it is defined by the property that gaugino masses are 
universal at the scale Mqut — 2 x 10^^ GeV. Indeed, in the parametrization we adopt from |24j . 
the gaugino mass ratios at the electroweak scale take the form 

Ml : Ms : M3 ~ (1 + 0.66a) : (2 + 0.2a) : (6 - 1.8a) , (1.1) 

where the case a = is precisely the unified mSUGRA limit. Note that when we speak of testing 
gaugino mass universality, therefore, we do not imagine a common gaugino soft mass at the low- 
energy scale. Instead, the "universality" paradigm implies the ratios 

Ml : M2 : M3 ~ 1 : 2 : 6. (1.2) 

The goal of this work is to ask whether it is possible to determine that the a parameter of (jl.ip is 
different from zero ~ and if so, how. 

The theoretical details behind the ratios of p.l|) will be the topic of Section [5] in this paper. 
These details are largely irrelevant for the analysis that follows in Sections [3] and HI but may 
nevertheless be of interest to many readers. For those who are only interested in the methodology we 
will pursue and the results, this section can be omitted. At the end of Section [2] we will present two 
benchmark scenarios that arise from concrete realizations of the mirage pattern of gaugino masses 
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in certain classes of string models. As this is a paper about the interface of theory and experiment 
at the LHC - and not about string phenomenology per se - we will leave the theoretical description 
of these models to the Appendix. In Section [3] we discuss how we will go about attempting to 
measure the value of the parameter a in (jl.ip and describe the process that led us to an ensemble 
of specific LHC observables targeted for precisely this purpose. In Section H] this list of signatures 
is tested on a large collection of MSSM models, as well as on our two special benchmarks from 
Section [2l We will see that the signature lists constructed using the method of Section [3] do an 
excellent job of detecting the presence of non-universality in the gaugino soft masses over a very 
wide array of super symmetric spectra hierarchies and mass ranges. Non-universality on the order of 
30-50% should become apparent within the first 10 fb~^ of analyzed data for most super symmetric 
models consistent with current experimental constraints. Detecting non-universality at the 10% 
level would require an increase in data by roughly a factor of two. Nevertheless, depending on 
the details of the superpartner spectrum, some cases will require far more data to truly measure 
the presence of non-universality. Of course all of these statements must here be understood in the 
context of the very particular assumptions of this study. Some thoughts on how the process can be 
taken further in the direction of increased realism are discussed in the concluding section. 

Before moving to the body of the paper, however, we would like to take a moment to emphasize 
a few broad features of the theoretical motivation behind the parametrization in (jl.ip . In the limit 
of very large values for the parameter a the ratios among the gaugino masses approach those of the 
anomaly-mediated supersymmetry breaking (AMSB) paradigm |25ll26j . In fact, the mirage pattern 
is most naturally realized in scenarios in which a common contribution to all gaugino masses is 
balanced against an equally sizable contribution proportional to the beta-function coefficients of 
the three Standard Model gauge groups. Such an outcome arises in string-motivated contexts, 
such as KKLT-type moduli stabilization in D-brane models \ri\ [28] and Kahler stabilization in 
heterotic string models [29]. These string-derived manifestations can also be extended easily to 
include the presence of gauge mediation, in which the mirage pattern is maintained in the gaugino 
sector [301 131]- Importantly, however, it can arise in non-stringy models, such as deflected anomaly 
mediation [321 133] . We note that in none of these cases is the pure- AMSB limit likely to be 
obtained, so our focus here will be on small to moderate values of the parameter a in (in|)i We 
will further refine these observations in Section [2] before turning our attention to the measurement 
of the parameter a at the LHC. 

^In any event, the phenomenology of the AMSB scenario is sufficiently distinct from the models we will consider 
that distinguishing between them should not be difficult [34] , 
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2 Theoretical Motivation and Background 



In this section we wish to understand the origin of the mass ratios in from first principles. We 
win treat the mirage mass pattern here in complete generality, without any reference to its possible 
origin from string-theoretic considerations. This short section concludes with two specific sets of 
soft parameters, both of which represent models with the mirage gaugino mass pattern (though 
the physics behind the rest of their soft supersymmetry breaking parameters are quite different). 
In the Appendix we will recast the discussion of this section in terms of the degrees of freedom 
present in low-energy effective Lagrangians from string model building. There we will also present 
the string theory origin of the two benchmark models that appear in Table [1] at the end of this 
section. 

Let us begin by imagining a situation in which there are two contributions to the soft super- 
symmetry breaking gaugino masses. We assume that these contributions arise at some effective 
high-energy scale at which supersymmetry breaking is transmitted from some hidden sector to the 
observable sector. Let us refer to this scale as simply the ultraviolet scale Auv It is traditional in 
phenomenological treatments to take this scale to be the GUT scale at which gauge couplings unify, 
but in string constructions one might choose a different (possibly higher scale) at which the super- 
gravity approximation for the effective Lagrangian becomes valid. We will further assume that one 
contribution to gaugino masses is universal in nature while the other contribution is proportional 
to the beta-function coefficient of the Standard Model gauge group. More specifically, consider the 
universal piece to be given by 



where a = 1, 2, 3 labels the Standard Model gauge group factors Ga and represents some mass 
scale in the theory. The second piece is the so-called anomaly mediated piece, which arises from 
loop diagrams involving the auxiliary scalar field of supergravity \35\ I36j . It will take the form 



where the ba are the beta-function coefficients for the Standard Model gauge groups. In our 
conventions these are given by 



where Ca, C* are the quadratic Casimir operators for the gauge group Qa, respectively, in the 
adjoint representation and in the representation of the matter fields charged under that groupie 
■^The convention chosen in (|2.5p is opposite of the one chosen in [37] . 




(2.3) 




(2.4) 




(2.5) 
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For the MSSM these are 



Note that if we take A, 
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{^1,^2,^3} -■ 
Aqut then we have 



gl (Auv) = gl (Auv) = gl (Auv) = gl^ 



(2.6) 



(2.7) 



The mass scale Mg is common to ah three gauge groups; the subscript is meant to indicate that 
the contribution in (j2.4p is related to the gravitino mass. The full gaugino mass at the high energy 
boundary condition scale is therefore 



Ma {A, 



Ml (Aw) + Ml (A 



Mu + gl{K^v)-^Mg. 



(2i 



Now imagine evolving the boundary conditions in (12. Sp to some low via the 

(one- loop) renormalization group equations (RGEs). For the anomaly-generated piece of (|2.4p we 
need only replace the gauge coupling with the value at the appropriate scale 

ha 



M'a (Ah 



gt (A: 



-'a 



■Ma 



2 



(2.9) 



while for the universal piece we can use the fact that Ma/ga is a constant for the one-loop RGEs. 
After some manipulation this yields 



Ml (Aew) = Mu 



l-gl (Aew) ^ In 



A, 



A, 



Combining (|2.10p and (|2.9p gives the low scale expression 



Ma (Aew) = M-Al - gl (Aew) 



87r2 



In 



A, 



Ae 



Ma 



M, In 



(2.10) 



(2.11) 



For gaugino masses to be unified at the low scale Aew then the quantity in the square brackets 
in ()2.1ip must be engineered to vanish. This can be achieved with a judicious choice of the values 
Mu and Mg for a particular high-energy input scale Auv Put differently, for a given Auv (such as 
the GUT scale) and a given overall scale there is a one-parameter family of models defined by 
the choice Mg. 

It is possible, however, to find a more convenient parametrization of the family of gaugino mass 
patterns defined by (I2.1ip . Consider defining the parameter a by 



Ma 



a 



Mu In (Auv/A 

EW } 



(2.12) 
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so that ()2.1ip becomes 

Ma (Aew) 



1 - 5a (Aew) ^ In 



A, 



A, 



(2.13) 



and the requirement of universaUty at the scale A^w now imphes a = 2. Normahzing the three 
gaugino masses by Mi (Aew) |a=o and evaluating the gauge couplings at a scale Aew = 1000 GeV 
we obtain the mirage ratios 



Ml : Ma : M3 = (1.0 + 0.66a) : (1.93 + 0.19a) : (5.87 - 1.76a) , 



(2.14) 



for Auv = Aqut, in good agreement with the expression in p.l|) . 

Let us generalize the parametrization in (j2.12p once more. Instead of defining the parameter in 
terms of the starting and stoping points in the RG evolution of the gaugino mass parameters, we 
will fix them in terms of mass scales in the theory itself. Thus we follow the convention of Choi et 
al. [38] and define 

Mr, 



(y= —1 ('2 151 

Mu\n{M,jMg) ' ^ • ^ 

where MpL is the reduced Planck mass MpL = 2.4 x 10^^ GeV. Our parametrization is now divorced 
from the boundary condition scales of the RG flow and can be fixed in advance. The choice of mass 
parameters in the logarithm of (j2.15p may seem arbitrary - and at this point it is indeed completely 
arbitrary - but they have been chosen so as to make better contact with string constructions, such 
as those which we present in the Appendix. Inserting ()2.15p into (j2.1ip yields 



Ma (Aew) 



Mull-gl (Aew) ^ 



Mu{l-gl (A; 




(2.16) 



Comparing this expression with (|2.10p it is clear if gauge couplings unify at a scale Ayv = Aqut, 
then we should expect the soft supersymmetry breaking gaugino masses to unify at an effective 
scale given by 



A. 



A. 



M, 

MpT 



a/2 



(2.17) 



We see that our parametrization in terms of a is indeed equivalent to a parametrization in terms 
of the effective unification scale, as suggested in the introduction. 

The value of a as defined in (|2.12p or (j2.15p can be crudely thought of as the ratio of the anomaly 
contribution to the universal contribution to gaugino masses. Indeed, the limit a ^ is the limit 
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Parameter 


Point A 


Point B 




Parameter 


Point A 


Point B 


a 


0.3 


1.0 






(1507)2 


(430.9)2 




1.5 TeV 


16.3 TeV 






(1504)2 


(610.3)2 


Ml 


198.7 


851.6 




ml^ 


(1505)2 


(352.2)2 


M2 


172.1 


553.3 






(1503)2 


(381.6)2 


Ms 


154.6 


339.1 




m% 


(1502)2 


(407.9)2 


At 


193.0 


1309 




2 


(1508)2 


(208.4)2 


Ab 


205.3 


1084 






(1506)2 


(302.7)2 


Ar 


188.4 


1248 




"^1)1,2 


(1505)2 


(347.0)2 


2 

mir 

tlu 


(1500)2 


(752.0)2 




"^il,2 


(1503)2 


(379.8)2 




(1503)2 


(388.7)2 




"^ll,2 


(1502)2 


(404.5)2 



Table 1: Soft Term Inputs. Initial values of supersymmetry breaking soft terms in GeV at the initial scale given 
by Auv = 2 X 10^'' GeV. Both points are taken to have /i > and tan/3 — 10. The actual value of tan/3 is fixed in 
the electroweak symmetry-breaking conditions. 

of the minimal supergravity paradigm, while a — > oo is the AMSB Umit. But as (|2.8p makes clear, 
these two contributions will be of comparable size only if Mg is at least an order of magnitude 
larger than Mu- We could therefore have chosen a parametrization based on the ratio r = Mg/Mu, 
with interesting values being in the range r ~ 0(10 — 100). But such a parametrization spoils 
the simple relation with the mirage unification scale (j2.17p . Furthermore, the introduction of the 
factor ln(MpL/Mg) in ()2.15p provides the needed large factor, taking a value of ln{Mpi^/Mg) ~ 35 
for Alg ~ 1 TeV. To obtain the mirage pattern it is therefore necessary for the underlying theory 
to generate some large number c ~ ln{Mpi^/Mg) ~ 30. Specific examples of how this is achieved in 
explicit string-based models are given in the Appendix to this paper. 

In Table [T] we have collected the necessary soft supersymmetry-breaking parameters to com- 
pletely specify two benchmark points for further analysis in what follows. The details behind these 
two models are described in the Appendix. Here we will simply indicate that point A represents a 
heterotic string model with Kahler stabilization of the dilaton which was studied in detail in |37j . 
This particular example has a value of a = 0.3. Point B is an example from a class of Type IIB string 
compactifications with fluxes which was studied in [38]. This second example has a value a = 1.0. 
Both are examples of the mirage pattern of gaugino masses, having mirage unification scales of 
Amir = 2.0 X 10^^ GeV and Amir = 1-5 x 10^ GeV, respectively. Note that these soft supersymmetry 
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breaking terms are taken to be specified at the GUT energy scale of Agut = 2.0 x 10^^ GeV and 
must be evolved to electroweak scale energies through the renormalization group equations. 

3 Determining a: Methodology 
3.1 Setting Up the Problem 

As mentioned in the introduction, the ultimate goal of this avenue of study is to determine whether 
or not soft supersymmetry breaking gaugino masses obey some sort of universality condition in- 
dependent of all other facts about the supersymmetric model. Such a goal cannot be met in a 
single paper so we have begun by asking a simpler question: assuming the world is defined by 
the MSSM with gaugino masses obeying the relation (|l.ip . how well can we determine the value 
of the parameter a. At the very least we would like to be able to establish that a ^ with a 
relatively small amount of integrated luminosity. The first step in such an incremental approach is 
to demonstrate that some set of "targeted observables" [12] (we will call them "signatures" in what 
follows) is sensitive to small changes in the value of the parameter a in a world where all other 
parameters which define the SUSY model are kept fixed. In subsequent work we intend to relax this 
strong constraint and treat the issue of gaugino mass universality more generally. Despite the lack 
of realism we feel this is a logical point of departure - very much in the spirit of the "slopes" of the 
Snowmass Points and Slopes [39j and other such benchmark studies. Thus, where the Snowmass 
benchmarks talk of slopes, we will here speak of "model lines" in which all parameters are kept 
fixed but the value of a is varied in a controlled manner. 

To construct a model line we must specify the supersymmetric model in all aspects other than 
the gaugino sector. The MSSM is completely specified by 105 distinct parameters, but only a small 
subset are in any way relevant for the determination of LHC collider observables }14j . We will 
therefore choose a simplified set of 17 parameters as in the two benchmark models of Table [T] 



tan/3, mjj^, mj^^ 

M3, At, Ab, Ar 
"iQl,2' ^Ul,2, "lDi,2, '^Ll,2' "^Sl,2 

mq^, mu3, ruD^, rriLs, 



(3.18) 



The parameters in (jS.lSp are understood to be taken at the electroweak scale (specifically Aew = 
1000 GeV) so no renormalization group evolution is required. The gluino soft mass M3 will set 
the overall scale for the gaugino mass sector. The other two gaugino masses Mi and M2 are then 
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determined relative to M3 via ()2.14p . A model line will take the inputs of (|3.18p and then construct 
a family of theories by varying the parameter a from a = (the mSUGRA limit) to some non-zero 
value in even increments. 

For each point along the model line we pass the model parameters to PYTHIA 6 . 4 [IQ] for 
spectrum calculation and event generation. Events are then sent to the PGS4 [IT] package to 
simulate the detector response. Additional details of the analysis will be presented in later sections. 
The end result of our procedure is a set of observable quantities that have been designed and (at 
least crudely) optimized so as to be effective at separating a = from other points along the model 
line in the least amount of integrated luminosity possible. In Section 13.21 we describe the manner 
in which we perform this separation between models. The signature lists, and the analysis behind 
their construction, is presented in Section [3. 3[ In Section H] we will demonstrate the effectiveness of 
these signature lists on a large sample of randomly generated model lines and provide some deeper 
insight on why the whole procedure works by examining our benchmarks in greater detail. 

3.2 Distinguishability 

The technique we will employ to distinguish between candidate theories using LHC observables 
was suggested in [12] and subsequently refined in [6j. The basic premise is to construct a variable 



where S is some observable quantity (or signature). The index i = 1, . . . ,n labels these signatures, 
with n being the total number of signatures considered. The labels A and B indicate two distinct 
theories which give rise to the signature sets and , respectively. Finally, the error term 6Sf^ 
is an appropriately-constructed measure of the uncertainty of the term in the numerator, i.e. the 
difference between the signatures. In this work we will always define a signature S as an observation 
interpreted as a count (or number) and denote it with capital N. One example is the number of 
same-sign, same-flavor lepton pairs in a certain amount of integrated luminosity. Another example 
is taking the invariant mass of all such pairs and forming a histogram of the results, then integrating 
from some minimum value to some maximum value to obtain a number. In principle there can be 
an infinite number of signatures defined in this manner. In practice experimentalists will consider 
a finite number and many such signatures are redundant. 



similar to a traditional chi-square statistic 




(3.19) 
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We can identify any signature Ni with an effective cross section crj via the relation 



N,/L, 



(3.20) 



where L is the integrated luminosity. We refer to this as an effective cross-section as it is defined 
by the counting signature Ni which contains in its definition such things as the geometric cuts that 
are performed on the data, the detector efficiencies, and so forth. Furthermore these effective cross 
sections, whether inferred from actual data or simulated data, are subject to statistical fluctuations. 
As we increase the integrated luminosity we expect that this effective cross section ai (as inferred 
from the data) converges to an "exact" cross section Cj given by 



lim fjj 



(3.21) 



These exact cross sections are (at least in principle) calculable predictions of a particular theory, 
making them the more natural quantities to use when trying to distinguish between theories. 
The transformation in (|3.2U|) allows for a comparison of two signatures with differing amounts of 
integrated luminosity. This will prove useful in cases where the experimental data is presented 
after a limited amount of integrated luminosity La, but the simulation being compared to the 
data involves a much higher integrated luminosity Lb- Using these notions we can re-express our 
chi-square variable {IS.Sab)'^ in terms of the cross sections 



(A5. 



AS 



1 



n 



E 



(3.22) 



We will assume that the errors associated with the signatures Ni are purely statistical in nature 
and that the integrated luminosities La and Lb are precisely known, so that 



^-af' = ^[^-at f + (5af )2 = ^-atlLA + of jLn , 



(3.23) 



and therefore (AS'^s)^ is given by 



(AS 



AB) 



n 



E 



- -of 



at/LA + af/Ls 



(3.24) 



where each cross section includes the (common) Standard Model background, i.e. ai = 

The variable {/^Sab)"^ forms a measure of the distance between any two theories in the space 
of signatures defined by the Sj. We can use this metric on signature space to answer the following 
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question: how far apart should two sets of signatures Sf' and be before we conclude that theories 
A and B are truly distinct? The original criterion used in [6j was as follows. Imagine taking any 
supersymmetric theory and performing a collider simulation. Now choose a new random number 
seed and repeat the simulation. Due to random fluctuations we expect that even the same set 
of input parameters, after simulation and event reconstruction, will produce a slightly different 
set of signatures. That is, we expect {/S.Saa)'^ / since it involves the effective cross-sections 
as extracted from the simulated data. Now repeat the simulation a large number of times, each 
with a different random number seed. Use (|3.24p to compute the distance of each new simulation 
with the original simulation in signature space. The set of all {/S.Saa)'^ values so constructed will 
form a distribution. Find the value of (A5'AA)^|g5 ^^^^ distribution which represents the 95th 
percentile of the distribution. This might be taken as a measure of the uncertainty in "distance" 
measurements associated with statistical fluctuations. 

This procedure for defining distinguishability is unwieldy in a number of respects. Determining 
the threshold for separating models by {ASabY > {^Saa)'^\^^ is computationally intensive as it 
requires many repeated simulations of the same model (as well as the Standard Model background). 
More importantly, the "brute force" determination of (AS'^^j^Igg is particular to model A as well 
as the list of signatures used in (j3.24p . Each change in either the model parameters or the signature 
mix demands a new determination of the threshold for distinguishability. We will therefore propose 
a new criterion that has the benefit of being analytically calculable with a form that is universal 
to any pair of models and any set of signatures. 

To do that let us reconsider the quantum fluctuations. At a finite integrated luminosity L we 
can describe the outcome of a counting experiment as a Poisson distribution approximated by a 
normal distribution (this is a good approximation for approximately 10 counts or more), which can 
be expressed as 



Here Z is a standard random variable, i.e. a random variable having a normal distribution centered 
at with a standard deviation of 1. Note that by introducing statistical fluctuations via the variable 
Z we can replace fij in (|3.25p with the exact cross section. Equation (j3.25p then merely states the 
well known fact that the distribution in measured values Ni should form a normal distribution 
about the value Lcrj. To combine two such distributions A'^i and we may write 




(3.25) 




(3.26) 
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where Z is a new standard random variable and a"^ is the total cross-section. For example, ai 
might be the contribution to a particular final state arising from Standard Model processes while 
(72 might be the contribution arising from production of super symmetric particles. 

With the above in mind we can re-visit the definition (j3.24p and obtain an analytic approxima- 
tion for the distribution in [ASab)'^ values by using random variables to represent the signatures. 
The measured cross sections can be related to the exact cross sections via 



af = Nf/LA = erf + ^u^/La Za , 
with a similar expression for the model B. Substituting (j3.27p into (j3.24p gives 



(3.27) 



{^Sab? 



- of 



-\ 2 



La ^ Le 



A B 



n 



E 



+ Z 



^ i_ 

La Lb 



(3.28) 



where we have combined Za and Zb into the random variables Z and Z' and have assumed that 
La and Lb are sufficiently large to be able to neglect the term proportional to Z'. In this limit we 
immediately see that (ASab)'^ is itself a random variable with a probability distribution for the 
quantity (ASab)'^ given by 

P{AS^)=nxlxinAS^), (3-29) 

where Xn x non-central chi-squared distribution for n degrees of freedom^ The non-centrality 

parameter A is given by 



A = E' 



CJ. 



B\2 



(3.30) 



af/LA+af/LB' 

and now the ai represent exact cross sections. This is actually the result we expect since the 
original (ASab)'^ in (|3.24p is essentially a chi-square like function. Note that since the ai in the 
distribution of (I3.28P are exact, we have the anticipated result that fiuctuations of the quantity 
{ASaa)'^ should be given by the central chi-square distribution Xn(0)- We note, however, that the 



^If we had chosen to define the separation variable (I3.19|) without the factor of l/n we would have found that the 
distribution of (ASab)^ values was exactly given by the non-central chi-square distribution. The two are related by 
a simple change of variables. 
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Figure 1: Plot of distribution in (ASaa)'^ values. The top panel plots the probability distribution function p.29[) 
for A = and n = 1, 3, 5 and 10. The lower panel plots the cumulative distribution function - the absolute 
probability for obtaining that value of (AS)^. The 95% percent threshold is indicated by the horizontal lines, and 
the corresponding values of (AS')^ are indicated by the marked values of 7n(0.95). 



derivation of (|3.28|) implicitly assumed that the signatures Si which we consider are uncorrelated 
- or more precisely that the fluctuations in these signatures are uncorrelated. We will have more 
to say about signature correlations in Section [3.31 below. We now have a measure of separation in 
signature space that is related to well known functions in probability theorylf] 

Armed with this technology, let us return to the issue of distinguishing a model from itself. 
From (j3.28p . (j3.29p and (j3.30p it is apparent that all the physics behind the distribution of possible 
{ASab)'^ values is contained in the values of A and n. In particular the distribution of possible 
(AS'aa)^ values (a central chi-square distribution) should depend only on the number n of signatures 
considered - not on the model point nor on the nature of those signatures. When comparing a 

'''In fact, the non-central chi-square distribution is related to the regularized confluent geometric functions . 
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model with itself we can therefore dispense with the subscript and write {ASaa)'^ = (AS*)^. We 
plot the probability distribution P{ASf of ([3:29]) for A = and various values of n in the top panel 
of Figure [TJ We have also plotted the cumulative distribution function for the same n values in the 
lower panel of Figure [TJ To rule out the null hypothesis {i.e. the hypothesis that models A and B 
are in fact the same model) to a level of confidence p requires demanding that {AS)"^ is larger than 
the p-th. percentile value for the distribution (|3.29|) for the appropriate n value. For example, if 
we use the criterion from [6] and require (ASab)'^ > {^S)'^\g5th t^^^n p = 0.95. We have indicated 
this value for the cumulative distribution function by the horizontal dashed line in Figure [TJ In 
general we will denote this particular value of {AS)'^\p for each value of n by the symbol 7n(p)- It 
can be found via the cumulative distribution function as in Figure [TJ or by numerically solving the 
equation 

r(f,f7„w)=r(f)(i-rt, (3.31) 

where T{n) is Euler's gamma function and T(n,m) is the incomplete gamma function. A summary 
of these values for smaller n values is given in Table [2j If we measure our n signatures, extract the 
cross-sections, form (ASab)'^ and the number is greater than ^n{p) then we can say that the null 
hypothesis can be ruled out at a level of confidence given by p x 100%. The value of this critical 
(A5)^|p = 7n(p) is a universal number determined only by our choice of p value and the number of 
signatures n that we choose to consider. 

If, however, our measurement gives (ASab)'^ < Inip) then we cannot say the two models are 
distinct, at least not at the confidence level p. But they may still be separate models and we 
were simply unfortunate, with statistical fluctuations producing a small value of {ASab)"^- If we 
accumulate more data and measure {ASab)"^ again, we may find a different result. To quantify 
the probability that two different models A and B will give a particular value of {ASab)"^ requires 
the use of the non-central chi-square distribution in (|3.29p . The degree of non-centrality is given 
by the quantity A in (j3.30p . Clearly, the more distinct the predictions and af are from one 
another, the larger this number will be. In Figure [2jwe plot the distribution for [ASab)"^ for n = 3 
signatures and several values of A. As expected, the larger this parameter is, the more likely we 
are to find large values of {ASabY- 

Let us assume for the moment that "model A" is the experimental data, which corresponds to an 
integrated luminosity of Our "model i?" can then be a simulation with integrated luminosity 

^sim _ qi^cxp might imagine that q can be arbitrarily large, limited only by computational 
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Confidence Level p 


n 


0.95 


0.975 


0.99 


0.999 


1 


3.84 


5.02 


6.64 


10.83 


2 


3.00 


3.69 


4.61 


6.91 


3 


2.61 


3.12 


3.78 


5.42 


4 


2.37 


2.79 


3.32 


4.62 


5 


2.21 


2.57 


3.02 


4.10 


6 


2.10 


2.41 


2.80 


3.74 


7 


2.01 


2.29 


2.64 


3.48 


8 


1.94 


2.19 


2.51 


3.27 


9 


1.88 


2.11 


2.41 


3.10 


10 


1.83 


2.05 


2.32 


2.96 



Table 2: List of 7n(p) values for various values of the parameters n and p. The value 7n(p) represents the 
position of the p-th percentile in the distribution of P{AS)'^ for any list of n signatures. For example, if we consider 
a list of 10 signatures, then the quantity (ASab)^ formed by these ten measurements must be larger than 1.83 to say 
that models A and B are distinct, with 95% confidence. If we demand 99% confidence this threshold becomes 2.32. 



resources §1 We can then rewrite (j3.30p as 



X = L-^Y. ';a.iJb ■ (3-32) 

From this expression it is clear that we can expect the value of this parameter A to increase 
as experimental data is collected. The larger the value of L'^^p the less likely it becomes to find a 
particularly small value of {/S.SabY'- This confirms out basic intuition that given any observable (or 
set of observables) for which the two models predict different values then with sufficient integrated 
luminosity it should always be possible to distinguish the models to arbitrary degree of confidence. 

For any given value of A 7^ 0, the probability that a measurement of {I^Sab)'^ will fluctuate to a 
value so small that it is not possible to separate two distinct models (to confidence level p) is simply 
the fraction of the probability distribution in ()3.29p that lies to the left of the value ^n{p)- If we 
wish to be at least 95% certain that our measurements will correctly recognize that two different 



® Among other benefits of a large value for q would be the reduction in uncertainties arising from the simulation side 
of the comparison, i. e. assuming that the simulation perfectly captures both the physics and the detector response, 
the remaining uncertainty would be that associated with the experimental observation associated with . 
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Figure 2: Plot of distribution in (ASab)^ values for n — 3 and various A. The probability distribution 
function (|3.29|) for A = 0, 5, 15 and 35 is plotted for the case of n = 3. The curves are normalized such that the total 
area under each distribution remains unity. Note that the peak in the distribution moves to larger values of (ASab)^ 
as the non-centrality parameter A is increased. 

models are indeed distinct we must require 

P = / nxlxinASls) d{AS\j,) = / xl,x{y) dy > 0.95 . (3.33) 

Since the value of the integral P in (j3.33p decreases monotonically as A increases the value of this 
parameter which makes (|3.33|) an equality is the minimum non-centrality value Aniin('i-)P) such that 
the two models can be distinguished. 

In other words for two distinct models A and B, any combination of n experimental signatures 
such that A > Aniin(^iP = 0.95) will be effective in demonstrating that the two models are indeed 
different 95% of the time, with a confidence level of 95%. We have successfully reduced the problem 
to an exercise in pure mathematics, as these Aniin('^)P) values can be calculated analytically without 
regard to the physics involved. A collection of values for small values of n are given in Table [3j 
Note that as we increase n the necessary value Amin increases, reflecting the fact that as more 
observations are made we should expect that it will become increasingly likely that at least one 
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Confidence Level 


P 


n 


0.95 


0.975 


0.99 


0.999 


1 


12.99 


17.65 


24.03 


40.71 


2 


15.44 


20.55 


27.41 


44.99 


3 


17.17 


22.60 


29.83 


48.10 


4 


18.57 


24.27 


31.79 


50.66 


5 


19.78 


25.71 


33.50 


52.88 


6 


20.86 


26.99 


35.02 


54.88 


7 


21.84 


28.16 


36.41 


56.71 


8 


22.74 


29.25 


37.69 


58.40 


9 


23.59 


30.26 


38.89 


59.99 


10 


24.39 


31.21 


40.02 


61.48 



Table 3: List of Amin(n,p) values for various values of the parameters n and p. A distribution sucii as tiiose 
in Figure [2] witii A = Amin(n,p) will have precisely the fraction p of its total area at larger values of (ASab)'^ than 
the corresponding critical value 7n(p) from Tabled A graphical example of this statement is shown in Figure |3l 



will show a large deviation. Indeed, the quantity A can be thought of as a measure of the overall 
distance from (ASab)'^ = in the n-dimensional signature space in units of the variances. As an 
example, again consider the case where n = 3. For this value of n the corresponding 73(0.95) = 2.61 
value can be found from Table [SJ while we can find Ainin(3, 0.95) = 17.17 from Table O We plot the 
distributions (|3.29p for {n. A} = {3,0} and {3,17.17} simultaneously in Figure [3l By construction, 
the area of the non-central distribution to the left of the indicated value of {{ASab)'^) = 2.61 will 
be precisely 5% of the total area. 

Having reached the end of our somewhat lengthy digression on probability theory we now return 
to the physics issue at hand. The requirement that A > Xuiin{n-,p) can be translated into a condition 
on the signature set and/or luminosity via the definition in (j3.32p . Let us make one final notational 
definition 

(^A _ B\2 

Rab = E(^ab). = E A.iB (3-34) 

where Rab has the units of a cross section. Our condition for 95% certainty that we will be able 
to separate two truly distinct models at the 95% confidence level becomes 



Amin(M^-95) 

Rab 



iexp > • (3.35) 
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Figure 3: Determination of Amin for the case n = 3. The plot shows an example of the distribution of (ASab)^ 
for n = 3. The curve on the left represent A = case, i.e. values we will get when we compare a model to itself. 95% 
of the possible outcomes of this comparison are below 2.61 which is shown on the plot. The curve on the right has 
A — 17.17 and 95% of the curve is beyond 2.61. As A increases, this curve moves further to the right and gets flatter. 



Given two models A and B and a selection of n signatures both Xmin{n,0.95) and Rab are com- 
pletely determined. Therefore the minimum amount of integrated luminosity needed to separate 
the models experimentally will be given by 

LminiP) = 5 • (3.36) 



We will be using (13.36P repeatedly throughout the rest of this paper. A well-chosen set of signatures 
will be the set that makes the resulting value of L^[^ determined from (|3.36p as small as it can 
possibly be. 

3.3 Specific Signature Choice 

Following the discussion in Section 13.21 we are in a position to define the goal behind our signature 
selection more precisely. We wish to select a set of n signatures Si such that the quantity L^ninip) 
as defined in (j3.36p . for a given value of p, is as small as it can possibly be over the widest possible 
array of model pairs A and B. We must also do our best to ensure that the n signatures we choose 
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to consider are reasonably uncorrelated with one another so that the statistical treatment of the 
preceding section is applicable. We will address the latter issue below, but let us first turn our 
attention to the matter of optimizing the signature list. 

We took as our starting point an extremely large initial set of possible signatures. These 
included all the counting signatures and most of the kinematic distributions used in [6] , all of the 
signatures of ^42j, several "classic" observables common in the literature [13] and several more 
which we constructed ourselves. Removing redundant instances of the same signature this yielded 
46 independent counting signatures and 82 kinematic distributions represented by histograms, for 
128 signatures in total. We might naively think that the best strategy is to include all of these 
signatures in the analysis (neglecting for now the issue of possible correlations among them). In 
fact, if the goal is statistically separating two models, the optimal strategy is generally to choose a 
rather small subset of the total signatures. Let us understand why that is the case. To do so we 
need a quantitative way of establish an absolute measure of the "power" of any given signature to 
separate two models A and B. This can be provided by considering the condition in (|3.36|) . For 
any signature Si we can define an individual {Ljain)i by 



where, for example, Aniin(l) 0.95) = 12.99. This quantity is exactly the integrated luminosity 
required to separate models A and B, to confidence level p, by using the single observable Si. For a 
list of N signatures it is possible to construct such (-Lmin)i values and order them from smallest 
value (most powerful) to largest value (least powerful). If we take any subset n of these, then the 
requisite Lmin that results from considering all n simultaneously is given by 



Referring back to Table [3] we see that the ratio Amin(n,p)/Amin(l,p) grows with n. This indicates 
that as we add signatures with ever diminishing (Lmin)j values we will eventually encounter a point 
of negative returns, where the resulting overall Lmin starts to grow again. 

As more signatures are added, the threshold for adding the next signature in the list gets steadily 
stronger. For a particular pair of models, A and B, it is always possible to find the optimal list of 
signatures from among a given grand set by ordering the resulting {L^[^)i values and adding them 
sequentially until a minimum of Lmin is observed. To do so, we note that kinematic distributions 
must be converted into counts (and all counts are then converted into effective cross sections). This 




(3.37) 




(3.38) 
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oL . . . . , . . . . , . . . . , . . . . ^ 

10 20 30 40 

Number of Signatures (n) 

Figure 4: An example of finding an "optimal" signature list. By sequentially ordering the calculated (I/min)i 
values for any particular pair of models in ascending order, it is always possible to find the optimal set of signatures 
for that pair by applying p.38p . In this particular example the minimum value of Lmin is found after combining just 
the first 12 signatures. After just the best six signatures we are already within 20% of the optimal value, as indicated 
by the shaded band. 

conversion requires specifying an integration range for each histogram. The choice of this range 
can itself be optimized, by considering each integration range as a separate signature and choosing 
the values such that (ivmin)i is minimized. 

Figure m based on an actual pair of models from one of our model lines, represents the outcome 
of just such an optimization procedure. In this case a clearly optimal signature set is given by 
the 12 signatures represented by the circled point, which yields Lmin = 2.4 fb^^. The situation in 
Figure H] is typical of the many examples we studied: the optimal signature set usually consisted 
of 0(10) signatures. If we are willing to settle for a luminosity just 20% higher than this minimal 
value then we need only 0(5) signatures, typically^ This 20% range is indicated by the shaded 
band in Figure |H Of course this "optimal" set of signatures {Si} is only optimal for the specific 
pair of models A and B. We must repeat this optimization procedure on a large collection of model 

^It is interesting to compare this to the results of [6] in which the effective dimension of signature space was found 
to be also 0(5) to 0(10). 
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pairs and form a suitable average of the results in order to find a set of signatures {Si} that best 
approximates the truly optimal set over the widest possible set of model pairs {A, B}. The lists we 
will present at the end of this section represent the results of just such a procedure. 

But before we present them, we must now address the issue of correlations. To be able to use 
the analytic results of our statistical presentation of the problem in Section 13.21 we must be careful 
to only choose signatures from a list in which all the members are uncorrelated with one another. 
This immediately suggests a dilemma: once a signature is chosen, many others in the grand set will 
now be excluded for being correlated with the first. This complicates the process of optimization 
considerably - the task now becomes to perform the above optimization procedure over the largest 
possible list of uncorrelated (or at least minimally correlated) signatures. To find the correlation 
between any two signatures Si and Sj it is sufficient to construct their correlation coefficient pij, 
given by 

AT 



COy[i,3 .. N L ' "J L ^ -J fn nn\ 



var(i)var(j) A^-.oo /i ^i^i^ 



where the represent the individual results obtained from each of the cross section measure- 
ments, labeled by the index k. 

In our analysis we estimated the entries in the 128 x 128 dimensional matrix of (j3.39p in the 
following crude manner. We began with a simple MSSM model specified by a parameter set as 
in (j3.18p . with gaugino masses having the unified ratios of ()1.2I) . We simulated this model = 
2000 times, each time with a different random number seed. The simulation involved generating 
5 fb~^ of events using PYTHIA 6.4, which were passed to the detector simulator PGS4. After 
simulating the detector response and object reconstruction the default level-one triggers included 
in the PGS4 detector simulation were applied. Further object-level cuts were then performed, as 
summarized in Tabled! After these object-specific cuts we then applied an event-level cut on the 
surviving detector objects similar to those used in [6]. Specifically we required all events to have 
missing transverse energy > 150 GeV, transverse sphericity St > 0.1, and > 600 GeV 
(400 GeV for events with 2 or more leptons) where Ht =J^t + Z^jetsPr*- Oi^ce all cuts were 
applied the grand list of 128 signatures was then computed for each run, and from these signatures 
the covariance matrix in (j3.39p was constructed. All histograms and counting signatures were 
constructed and analyzing using the ROOT-based analysis package Parvicursor [44| . 

Not surprisingly, many of the signatures considered in our grand list of 128 observables were 
highly correlated with one another. For example, the distribution of transverse momenta for the 
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Object 


Minimum pT 


Minimum \r]\ 


Photon 


20 GeV 


2.0 


Electron 


20 GeV 


2.0 


Muon 


20 GeV 


2.0 


Tau 


20 GeV 


2.4 


Jet 


50 GeV 


3.0 



Table 4: Initial cuts to keep an object in the event record. After event reconstruction using tlie package 
PGS4 we apply additional cuts to the individual objects in the event record. Detector objects that fail to meet the 
above criteria are removed from the event record and do not enter our signature analysis. These cuts are applied to 
all analysis described in this paper. 



hardest jet in any event was correlated with the overall effective mass of the jets in the events 
(defined as the scalar sum of all jet pT values: Mgfj = J2jetsPT^)- Both were correlated with the 
distribution of Hj- values for the events, and so forth. The consistency of our approach would then 
require that only a subset of these signatures can be included. One way to eliminate correlations 
is to partition the experimental data into mutually-exclusive subsets through some topological 
criteria such as the number of jets and/or leptons. For example, the distribution of Ht values in 
the set having any number of jets and zero leptons will be uncorrelated with the same signature 
in the set having any number of jets and at least one lepton. Our analysis indicated that this 
partitioning strategy has its limitations, however. The resolving power of any given signature tends 
to diminish as the set it is applied to is made ever more exclusive. This is in part due to the 
diminishing cross-section associated with the more exclusive final state (recall that our metric for 
evaluating signatures is proportional to the cross-section). It is also the case that the statistical 
error associated with extracting these cross-section values from the counts will grow as the number 
of events drops. We were thus led to consider a very simple two- fold partitioning of the data: 

A^jets < 4 versus A'jcts > 5, 

iVleptons = versus Meptons > 1- 

This choice of data partitioning is reflected in the signature tables at the end of this section. 

Within each if the four subsets it is still necessary to perform a correlation analysis and construct 
the matrix in (j3.39p . Let us for the moment imagine that we are willing to tolerate a correlation 
among signatures given by some value e. Then the matrix of correlations in (j3.39p can be converted 
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Description 


Min Value 


Max Value 


1 


= + EaiiPf [All events] 


1250 GeV 


End 



Table 5: Signature List A. The effective mass formed from tlie transverse momenta of all objects in the event 
(including the missing transverse energy) was the single most effective signature of the 128 signatures we investigated. 
Since this "list" is a single item it was not necessary to partition the data in any way. For this distribution we integrate 
from the minimum value of 1250 GeV to the end of the distribution. 

into a matrix Cab which defines the uncorrelated signatures by assigning the values 

= (3.41) 

[0 it pab> e. 

The matrix Cab is actually the adjacency matrix of a graplj§ and the problem of finding all the 
possible sets of uncorrelated signatures is equivalent to finding all the complete subgraphs (or 
'clique') of that graph. A complete graph is a graph which has an edge between each vertex. In 
terms of our problem, this means a set of signatures having at most a correlation at the level 
of e between any two of them. This is a well-known problem in combinatorics that becomes 
exponentially more difficult to solve as the number of signatures increases. For our purposes we 
will be working with relatively small sets of signatures which were pre-selected on the basis of their 
effectiveness for separating a = from non-zero values of this parameter. Then from these sets we 
will proceed to build the maximal subgraph for our choice of allowed correlation e. 

We constructed a large number of model families in the manner described in Section [3. H each 
involving the range —0.5 < a < 1.0 for the parameter a in steps of Aa = 0.05. For each point 
along these model lines we generated 100,000 events using PYTHIA 6.4 and PGS4. To this we added 
an appropriately- weighted Standard Model background sample consisting of 5 fb~^ each of t/t 
and b/b pair production, high-p-r QCD dijet production, single and Z-boson production, pair 
production of electroweak gauge bosons W^Z and Z Z), and Drell-Yan processes. To 

examine which of our 128 signatures would be effective in measuring the value of the parameter a 
we fixed "model A" to be the point on each of the model lines with a = and then treated each 
point along the line with a 7^ as a candidate "model B." Clearly each model line we investigated 
- and each a value along that line - gave slightly different sets of maximally effective signatures. 
The lists we will present in Tables [5l [6] and [7| represent an ensemble average over these model lines, 

*A graph is a set of vertices connected by edges. An element of an adjacency matrix of a graph is 1 if there is an 
edge between two vertices, otherwise. 
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Description 


Min Value 


Max Value 


1 


M^g" [0 leptons, > 5 jets] 


1100 GeV 


End 


2 


M^g^ [0 leptons, < 4 jets] 


1450 GeV 


End 


3 


M^g^ [> 1 leptons, < 4 jets] 


1550 GeV 


End 


4 


Pt (Hardest Lepton) [> 1 lepton, > 5 jets] 


150 GeV 


End 


5 


M-^l'' [0 leptons, < 4 jets] 


GeV 


850 GeV 



Table 6: Signature List B. The collection of our most effective observables, restricted to the case where the 
maximum correlation between any two of these signatures is 10%. Note that the jet-based effective mass variables 
would normally be highly-correlated if we had not partitioned the data according to H3.40p . For these distributions 
we integrate from "Min Value" to "Meix Value" . 

restricted to a maximum correlation amount e as described above. 

Let us begin with Table [5l which gives the single most effective signature at separating models 
with different values of the parameter a. It is the effective mass formed from all objects in the 
event 

M:^^=^T + Y.Pf^ (3-42) 

all 

where we form the distribution from all events which pass our initial cuts. That this one signature 
would be the most powerful is not a surprise given the way we have set up the problem. It is the 
most inclusive possible signature one can imagine (apart from the overall event rate itself) and 
therefore has the largest overall cross-section. Furthermore, the variable in (j3.42p is sensitive to 
the mass differences between the gluino mass and the lighter electroweak gauginos - precisely the 
quantity that is governed by the parameter a. Yet as we will see in Section [J] this one signature can 
often fail to be effective at all in certain circumstances, resulting in a rather large required Lmin 
to be able to separate a = from non- vanishing cases. In addition it is built from precisely the 
detector objects that suffer the most from experimental uncertainty. This suggests a larger and 
more varied set of signatures would be preferable. 

We next consider the five signatures in Table [H These signatures were chosen by taking our 
most effective observables and restricting ourselves to that set for which e = 10%. We again see 
the totally inclusive effective mass variable of (|3.42|) as well as the more traditional effective mass 
variable, M^'^^, defined via (j3.42p but with the scalar sum of pT values now running over the jets 
only. We now include the pT of the hardest lepton in events with at least one lepton and five or 
more jets, as well as the invariant mass M^^^ of the jets in events with zero leptons and 4 or less 
jets. The various jet-based effective mass variables would normally be highly correlated with one 
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Description 


Min Value 


Max Value 



Counting Signatures 



Ni [> 1 leptons, < 4 jets] 
Ni+;,- [mC^~ =Mz±5 GeV] 
Nb [> 2 B-jets] 



[0 leptons, < 4 jets] 



4 


err 


1000 GeV 


End 


5 


inv 


750 GeV 


End 


6 




500 GeV 


End 


[0 leptons, > 5 jets] 


7 


cn 


1250 GeV 


3500 GeV 


8 


rjct [3 jets > 200 GeV] 


0.25 


1.0 


9 


PT(4th Hardest Jet) 


125 GeV 


End 


10 




0.0 


0.25 


[> 1 leptons, > 5 jets] 


11 




0.0 


0.25 


12 


Pt (Hardest Lepton) 


150 GeV 


End 


13 


PT(4th Hardest Jet) 


125 GeV 


End 


14 




1250 GeV 


End 



Table 7: Signature List C. In this collection of signatures we have allowed the maximum correlation between 
any two signatures to be as high as 30%. Note that some of the signatures are normalized signatures, (#8, #10 and 
#11), while the first three are truly counting signatures. A description of each of these observables is given in the 
text. For all distributions we integrate from "Min Value" to "Max Value". 



another if we were not forming them from disjoint partitions of the overall data set. The favoring 
of jet-based observables to those based on leptons is again largely due to the fact that jet-based 
signatures will have larger effective cross-sections for reasonable values of the SUSY parameters 
in ()3.18p than leptonic signatures. The best signatures are those which track the narrowing gap 
between the gluino mass and the electroweak gauginos and the narrowing gap between the lightest 
chargino/second- lightest neutralino mass and the LSP mass. In this case the first leptonic signature 
to appear - the transverse momentum of the leading lepton in events with at least one lepton - is 
an example of just such a signature. 

Finally, let us consider the larger ensemble of signatures in Table [71 In this final set we have 
relaxed our concern over the issue of correlated signatures, allowing as much as 30% correlation 
between any two signatures in the list. This allows for a larger number as well as a wider variety of 
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observables to be included. As we will see in Section U] this can be very important in some cases in 
which the supersymmetric model has unusual properties, or in cases where the two a values being 
considered give rise to different mass orderings (or hierarchies) in the superpartner spectrum. In 
displaying the signatures in Table [7| we find it convenient to group them according to the partition 
of the data being considered. Note that the counting signatures are taken over the entire data set. 

The first counting signature is simply the total size of the partition from (|3.4U|) in which the 
events have at least one lepton and 4 or less jets. This was the only observable taken on this data 
set that made our list of the most effective observables. The next two signatures are related to 
"spoiler" modes for the trilepton signal. Note that the trilepton signal itself did not make the list: 
this is a wonderful discovery mode for supersymmetry, but the event rates between a model with 
a = and one with non- vanishing a were always very similar (and low). This made the trilepton 
counting signature ineffective at distinguishing between models. By contrast, counting the number 
of b-jet pairs (a proxy for counting on-shell Higgs bosons) or the number of opposite-sign electron 
or muon pairs whose invariant mass was within 5 GeV of the Z-mass (a proxy for counting on-shell 
Z-bosons) were excellent signatures for separating models from time to time. This was especially 
true when the two models in question had very different values of a such that the mass differences 
between N2 and A'^i were quite different in the two cases. We will give specific examples of such 
outcomes in Section HI 

The following three sections of Table [7] involve some of the same types of observables as in the 
previous tables, with a few notable changes and surprises. First note that several of the observables 
in Table [7| involve some sort of normalization. In particular numbers 8, 10 and 11. Our estimate of 
the correlations among signatures found that the fluctuations of these normalized signatures tended 
to be less correlated with other observables for that partition than the un-normalized quantities. 
However, normalizing signatures in this way also tended to reduce their ability to distinguish 
models. Signature ^8 is defined as the following ratio 

lets I Jet4 

net = %^ (3.43) 
Ft +Pt 

where pj^^ is the transverse momentum of the i-th hardest jet in the event. For this signature we 
require that there be at least three jets with px > 200 GeV. This signature, like the pT of the 
hardest lepton or the pT of the 4th hardest jet, was effective at capturing the increasing softness 
of the products of cascade decays as the value of a was increased away from a = 0. 

Let us note that Lists A, B and C are not mutually disjoint. For example, signatures 4, 5 and 12 
of Table [7] also appear in Table [H The signature mix is determined by attempting to minimize L^[^ 
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via the formula in (j3.38p while attempting to keep the correlations between any pair of signatures 
below the targets set above in the text. As mentioned earlier, larger lists are not always better 
- the more signatures one adds, the larger the likelihood that some pair will be correlated with 
one another to an unsatisfactory amount. Furthermore, when signatures are added which are 
only occasionally useful, the resolving power of the ensemble can actually be degraded since the 
statistical threshold defined by Amin in Table O grows with the number of signatures. 

We will see some examples of this perverse effect in the next section in which we will examine the 
effectiveness of these three lists. We will do this first against our benchmark models from Section [2] 
and then against a large ensemble of random MSSM model lines. Before doing so let us note that 
by fixing a particular set of n signatures in every instance - and indeed, with the fixed integration 
ranges indicated in the Tables - we are very likely to often be far from the optimal signature mix and 
integration ranges. That is, we should not expect to achieve the absolute Lmin value of Figure S] for 
any particular pair or points along a model line. If we have chosen our signature list well, however, 
then we can hope that the result of adding the contributions of all n signatures using (j3.38|) will 
be close to the optimal Lmin value over a large array of model pairs. 

4 Analysis Results 

In this section we will examine how well our signature lists in Tables [5l [6] and [7] perform in measuring 
the value of the parameter a which appears in (jl.ip . Recall that our specific goal is to distinguish 
between a model with a = and another with all other soft terms held equal, but with a ^ 0. We 
would like to do this with the least amount of data (or integrated luminosity) as possible for the 
smallest values of a possible. We will first demonstrate how the lists perform on our benchmark 
cases before turning to an analysis of their performance on a large ensemble of randomly-generated 
supersymmetric models. 

4.1 Benchmark Models Analysis 

We begin with the theory-motivated benchmark models briefiy mentioned at the end of Section [2] 
and discussed at length in the Appendix. The input values for the soft supersymmetry-breaking 
parameters are listed in Table [T] at the very end of Section [21 To remind the reader, model A is an 
example of a heterotic string compactification with Kahler stabilization of the dilaton while model B 
is an example of a Type IIB string model with flux compactification. Each of these examples predicts 
a particular value of a as a function of other parameters in the theory; specifically, model A predicts 
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476.1 
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ink 
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niA 


1557 
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1495 


540.4 


niuo 


1557 


806.8 






1500 


545.1 


mH± 


1559 


811.1 






1501 


514.6 



Table 8: Low energy physical masses for benchmark points. Low energy physical masses (in units of GeV) 
are given at the scale 1 TeV. All points are taken to have /x > 0. The actual value of tan /3 is fixed in the electroweak 
symmetry-breaking conditions. 

a ~ 0.3, while model B predicts a ~ 1. Further details can be found in the Appendix (and references 
therein), but these details are not relevant for our purposes in this section. 

The input values of Table [1] were evolved from the input scale Ayy = 2 x 10^^ GeV to the 
electroweak scale of 1 TeV by solving the renormalization group equations. For this we use the 
computer package SuSpect |45j . utilizing two-loop running for all parameters except for the gaugino 
masses. For these we use one-loop RGEs only in order to maintain the parametrization for the 
gaugino soft parameters in terms of a given by (jl.ip . Once run to the low scale the physical 
spectra and mixings of the models were computed by SuSpect. The result of this process for our 
two benchmark models is given in Table [51 

From here we performed a simulation using the combined package of PYTHIA + PGS4 as described 
in Section 13. 3i For each of these two models a model-line was generated by varying the parameter 
a from a = to a = 1, in increments of 0.05, while keeping all other soft parameters fixed. 
Along these model lines the gluino soft mass M3 was held constant to set the overall scale, and the 
two parameters Mi and M2 were varied according to the ratios in (jl.ip . For each point 500,000 
events were generated using the LI trigger options in PGS4. After applying further initial cuts as 
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Figure 5: Lmin as a function of a for the two benchmark models. The three shaded regions correspond to 
tlie three signature lists as indicated by the legend. The lower bound of each shaded region indicates the minimum 
integrated luminosity Lmin needed to separate the model with the specified a from a = (top panels) or the predicted 
value of a (lower panels) . The upper bound of the shaded region represents an estimate of the 1 sigma upper bound 
on the calculated value of Lmin caused by statistical fiuctuations. 



described in Section 13.31 the signatures associated with each of the three hsts in Tables O [6] and [7] 
were constructed. We then used the criterion for distinguishabihty described in Section 13.21 to 
determine the minimum luminosity Lmin needed to separate a = from all other points along the 
line. 

The results of this analysis are presented in the top panels of Figure [5l The plot on the left 
corresponds to benchmark model A while the one on the right corresponds to benchmark model B. 
The vertical axis shows the minimum luminosity needed to separate a given a 7^ scenario from 
the unified case of a = 0. The three shaded regions represent the three model lists we used 
to analyze the data. At the lower edge of each region is the value of Lmin as calculated using 
the relations in (I3.37P and ()3.38p . The upper edge of each region represents an estimate of the 
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1 sigma upper bound on the calculated value of L^[^ caused by statistical fluctuations (i.e. the 
fact that the cross-sections extracted from the data or simulation are not the true cross-sections 
for each signature). The lower panels in Figure [5] represent the same analysis, but now each of 
the two models are compared to their predicted values: a = 0.3 for model A and a = 1.0 for 
model B. With the exception of the straw-man List A in the case of benchmark model A, all the 
lists do an adequate job of distinguishing points along these alpha- lines with moderate amounts 
of integrated luminosity. Naturally, as the two points being compared approach one another the 
signature difference between them become smaller and the needed Lmin increases. It is instructive 
to consider the case of model A to understand why some approaches to extracting the parameter 
a succeed and others fail. 

Model A has nearly universal scalar masses at a rather high scale of approximately 1.5 TeV, 
yet the light gluino makes this the model with the higher overall cross-section. All supersymmetric 
observables in this benchmark model are therefore dominated by gluino pair production and their 
eventual cascade decays through highly off-shell squarks. In the analysis the gluino mass is kept 
constant along an alpha-line, so the cross-section for the dominant process gg gg is fixed at 
cr{gg — > gg) = 13.4 pb for this alpha-line. Any signatures related to this variable will depend on a 
only via the change in the gluino branching fractions, which are nearly constant as a function of 



the parameter ajj Blunt signatures like the total M^g variable of (|3.42|) indicate roughly the total 
production cross-section and crude mass scale of the superpartner being predominantly produced. 
This is an example in which the most inclusive possible observable is simply too inclusive to detect 
the change in gaugino mass ratios. For this one must consider processes that produce electroweak 
gauginos, which are subdominant by as much as a factor of ten in the case of benchmark model A. 

Further compounding the problems for the inclusive signature of List A is the fact that the 
count rate for this particular final state is varying only very slowly with a. Despite the fact that 
this count rate can be quite large in this model, the resulting value of L^i^ is high because the 
ASab value for this particular signature is very near zero. As a result, small statistical fiuctuations 
in the data or the simulation result in large fiuctuations in the resulting value of Lmin needed to 
truly separate different values of the parameter a. This reflects itself in both the width of the 
shaded region in the left panels of Figure [5] and in the volatility of the extracted value itself. In 
Figure E] we plot the distribution of the List A variable (j3.42p in benchmark model A for the case 
of a = (solid line) and a = 1 (dashed line). Above our integration cut of 1250 GeV there is very 

'Only the highly suppressed three-body decay g — » Ciqq' with q and q' representing third-generation quarks shows 
any significant dependence on the value of the parameter a for this benchmark model. 
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Figure 6: Distribution of the variable M^g^ from signature List A for benchmarlc model A. Solid filled 
histogram is the case for a = 0, dotted histogram is the case for a — 1. The lower bound for the integration region is 
indicated by the dotted line at 1250 GeV. The sharp lower bound in the distribution is an artefact of the event-level 
cuts imposed on the data as described in Section [3.31 In this case the failure of List A to separate the two cases is 
apparent: the difference between the two histograms is negligible above the value Al^g^ = 1250 GeV. The resolving 
power would improve dramatically if this lower bound was relaxed to — 500 GeV, as demonstrated in Figure [T] 



little difference between the distributions, even for this extreme case. However, it is clear that some 
discrimination power is available had we chosen a different lower bound for integration. When the 
lower bound on this particular variable is relaxed to 500 GeV the inclusive M^s variable becomes 
competitive with the other signature lists, as shown in Figure [71 

Benchmark model A therefore provides us with an example where the procedure of optimiz- 
ing the signature list over a wide ensemble of models has produced a prescription that is most 
definitely not optimal for this particular case. Once a particular model framework is established 
it will of course be possible to tailor analysis techniques to optimize the statistical power of any 
given signature. But for our quasi-model-independent analysis we must forgo optimization in favor 
of generality. Nevertheless, we gain resolving power by simply expanding the list of signatures 
to include those which are more sensitive to the changes in the lower-mass electroweak gaugino 
spectrum. Returning to the left panels of Figure [5] it is clear that Lists B and C do far better at 
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Gaugino Mass Non-Universality Parameter a 



Figure 7: Lmin as a function of a for benchmark model A with relaxed lower bound on M^g^ . The 

three shaded regions correspond to the three signature lists as in the upper left panel of Figure [5] In this case the 
lower bound of the integration range for the single observable of List A has been relaxed to 500 GeV. 



measuring the parameter a than the single Mefj variable alone. For example, the jet invariant mass 
variables in both lists, as well as the normalized f^T signatures and pxiJet^) observable of List C 
are much more sensitive to changes in a for this benchmark model than the observable in (j3.42p . 

But note the reduction in resolving power that occurs when we choose the largest signature list. 
As discussed in Section [3. 3t it is clear that the largest possible signature list is not always the most 
effective at separating two theories. In this particular example many of the additional observables 
in List C are not at all helpful in separating different a values - particularly the counting variables 
for which the total rates are low and the differences across the alpha-line are small. These additional 
variables were designed to be most effective when the mass hierarchies in the superpartner spectrum 
change as the value of a is modified, so that dramatic changes in production rates and/or branching 
ratios occur. Such threshold effects do not occur over the a range probed in benchmark model A, 
but do in fact occur for benchmark model B. This is clearly evident in the right panels of Figure O 
where additional resolving power is obtained when using the expanded signature List C. 

We note that the single inclusive variable of ()3.42p is much more effective in benchmark model B 
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Figure 8: Distribution of the variable M^^^ from signature List A for benchmark model B. Solid filled 
histogram is the case for a — 0, dotted histogram is the case for a = 1. The lower bound for the integration region is 
indicated by the dotted line at 1250 GeV. The sharp lower bound in the distribution is an artefact of the event-level 
cuts imposed on the data as described in Section [3.31 



in part because the production cross-sections for all SU (3)-charged superpartners are roughly equal 
in magnitude. The inclusive Meg variable no longer tracks the mass and decay products of a single 
heavy state so variations with the parameter a are now more prominent. This is shown in Figure [HI 
which should be compared to the case of model A in Figure O Note that the total area under the 
two curves in Figure [8] is nearly identical, highlighting the need to choose a wise value of the lower 
bound on the integration region to achieve a high degree of differentiation. Model B is similar to 
the randomly-generated models we used to design our signature lists and thus the chosen value 
of 1250 GeV for this particular observable is close to what would be the optimal choice for this 
particular model comparison. 

Despite the lower overall cross-section for the supersymmetric signal in benchmark model B, 
the three signature lists succeed in distinguishing the case a = from non- vanishing cases with far 
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Figure 9: Values of {RAB)i for the five signatures of List B as a function of a for benclimark model B. 

The ability of each individual signature from List B to resolve the case a = from the indicated value of a is given 
by the height of the curve {RAB)i in the above plots. In the left panel we display signature 1 (solid curve) and 
signature 5 (dashed curve). In the right panel we display signature 2 (solid curve), signature 3 (dashed curve) and 
signature 4 (dotted curve). 

less integrated luminosity. In large part this is due to the richness of the particle spectrum for this 
model. The superpartner masses given in Table [8] are for the case a = 1. As a approaches zero 
the masses of the lighter neutralinos and lightest chargino fall relative to that of the gluinos and 
squarks (which remain constant). Along this alpha-line several important thresholds are crossed, 
resulting in dramatic changes in the relevant branching fractions for the heavier states. The mix of 
signatures in List B and List C that contribute most strongly to the resolving power of the overall 
list changes as we move along the alpha- line. For example, consider the {RAB)i values of ()3.34p for 
the five signatures of List B. We plot these values in Figure E] for model A corresponding to q = 
and model B corresponding to the indicated value of a 7^ 0. 

To understand these curves, we first note that the dominant SUSY production processes in 
benchmark model B are the pair production of stops and associated production of light squarks 
with a gluino. The branching fraction for three of the more important decay modes of the stop are 
plotted versus the parameter a in the left panel of Figure [TOl For values of a ^ 0.35, when both the 
chargino Ci and the LSP A^i are sufficiently light, the direct two-body decay into the LSP and a 
top quark is dominant. About 50% of the time the W-bosons from the top decays on both sides of 
the events will decay hadronically and the event will be captured by the first observable in List B. 
For the intermediate region 0.35 ^ a ^ 0.6 the stop decays predominantly via ti Cih and the 
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Figure 10: Branching fractions for principal decay modes of lightest stop (left panel) and lightest 
chargino (right panel) as a function of a for benchmark model B. In the left panel the decay modes are 
ti Nit (dashed curve), t\ C\b (solid curve), and t\ —> Nic (dashed curve). In the right panel the decay modes 
are Ci NiW (solid curve) and Ci —^iib (dashed curve). 

final state topology is determined by the subsequent decay of the chargino. The branching fractions 
for the primary decay channels of the chargino Ci are given in the right panel of Figure [TOj In 
this intermediate a region the chargino is decaying primarily to a W-boson, populating all of the 
signatures in List B. 

For larger values of a ^ 0.6 the chargino Ci and the LSP A'^i are now massive enough that the 
only decay channel available for the stops is the process ti Nic, producing and two jets only. 
These events are captured by the second and (especially) fifth observables in List B, as evidenced 
by their rapid growth in significance. For a ^ 0.7 charginos that are directly produced (or produced 
through cascade decays of heavier squarks) will now decay into stops via Ci tib ^ Nicb. This 
boosts the resolving power of the signatures with lepton vetoes relative to the other signatures in 
List B. 

Similar arguments explain the behavior of the expanded list of observables in List C. Here we 
will only take a moment to mention the counting signatures which make their first appearance in 
our analysis. Generally speaking, counting signatures are sensitive only to the total cross-section 
for the final state being counted. Changes in the pT of Standard Model particles produced in 
cascades are washed out, making them less useful for comparing different gaugino mass hierarchies. 
Counting signatures are therefore only effective when the two a values being compared correspond 
to different decay patterns altogether. This happens in several instances in benchmark model B, as 
we indicated above. The counting signatures in List C are specifically designed to consider changes 
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Figure 11: Branching fraction for next-to-lightest neutralino (left) and {RAB)i values for key counting 
signatures from List C (right). The branching fraction of the next-to-lightest neutralino A^2 for benchmark 
model B is plotted as a function of a in the left panel. The decay modes are N2 Nih (dashed curve) and N2 Ni Z 
(solid curve). In the right panel the (RAB)i values for the inclusive leptonic counting signature (signature 1 - solid 
curve) and the inclusive B-jet counting signature (signature 3 - dashed curve) are plotted as function of a. 



in the decay table for the next-to-hghtest neutrahno ^ particularly the appearance of the so- 
called "spoiler" modes for the classic trilepton signal. In the left panel of Figure [TT] the primary 
decay modes of the next-to-lightest neutralino are given. We observe that both of the on-shell 
decays Nih and NiZ are available for this state when a ^ 0.7, with the Higgs mode 

peaking around a ~ 0.6 before becoming kinematically inaccessible. This changeover is reflected 
in the Ri values for the leptonic counting signature and the B-jet counting signature of List C, 
as shown in the right panel of Figure [TTJ Note that the light stop in benchmark model B makes 
this a very B-jet rich point. In fact, this particular counting signature is one of the most effective 
observables in List C along the alpha-line for this point. 

4.2 Analysis of a Large Set of Model Variations 

We next examine the efficacy of our method by testing it on a large sample of varying model points. 
We will do this in two steps: first on a controlled sample of models and subsequently on a random 
collection of model lines. Ranges for the MSSM input parameters and variation steps used for our 
controlled sample are given in Tabled Only M3, rriq, and /i were allowed to vary across 5 
uniform steps. All other soft parameters were held constant. The gaugino universality parameter 
a was also varied in 4 steps from a = 0, to 0.33, 0.66, and 1.0. These choices discretize the range 
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Input Parameter Range 


Variation 


400 GeV > M3 > 800 GeV 


5 steps 


400 GeV > /i > 1000 GeV 


5 steps 


300 GeV > (me^.B,"^fL,fl) > 700 GeV 


5 steps 


500 GeV > {mQ^,mq^,mi^ j^,mi^ j^) > 1000 GeV 


5 steps 


tan ^ = 10 


Fixed 


ruA = 1000 GeV 


Fixed 


^T) -^ti ^6) ^U) -^d — 


Fixed 



Table 9: MSSM soft parameters ranges and variation steps used to generate controlled sample. These 
values are given at the electroweak scale. For each choice of MSSM input, the gaugino unification parameter a was 
varied in four steps, from a = to a = 1.0 

of parameter space into 2500 individual model points. Note that the parameters of Table are 
given at the low-energy electroweak scale. We emphasize the fact that in this first step we have 
chosen to sample the parameter space on a discrete grid rather than sampling it randomly. While 
a truly random sampling is necessary for ultimately testing our method, we here wish to study the 
performance of our signature sets as key parameters are varied. Our discrete grid is designed to 
keep the overall supersymmetric production rate roughly fixed, allowing for a more straightforward 
comparison of Lmin values. This course sampling also allows a large degree of model variation while 
keeping computation time to a minimum. Our analysis of a random collection of models will appear 
at the end of this subsection. 

Simulated data for the model points was generated with the following procedure. For each 
model, the SuSpect partner code SusyHIT [36] was used to compute the low-scale spectrum from 
the input MSSM soft terms. No renormalization group evolution was necessary because the input 
parameters were given at the electroweak scale. As before PYTHIA + PGS4 was used to simulate 
the detector response for each point. A check was performed to ensure that each model point 
had a neutralino LSP, and also that each q > model point simulated had an associated a = 
counterpart, so that the minimum luminosity required to distinguish between the two models, L^i^, 
could be computed. Only models satisfying these requirements were retained for analysis. Exactly 
1449 model pairs (a = and a 7^ 0) were retained after applying this selection procedure. 

Table [TO] gives the dominant production modes across the entire set of model variations. The 
upper table indicates the mode and percentage of models, for a given a choice, that occur with the 
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Largest Production Channel 



Mode 


a = 


a = 0.33 


a = 0.66 


a = 1.0 


99 99 


44.6% 


45.2% 


42.9% 


44.8% 


fg Qr9 


31.1% 


30.2% 


33.1% 


35.7% 


fO — <lLf) 


2-1.3% 


25.5% 


23.9% 


19.4% 


Second Largest Production Channel 


Mode 


a = 


a = 0.33 


a = 0.66 


a = 1.0 


99 99 


2.7% 


2.1% 


2.8% 


1.4% 


fg QR9 


42.0% 


48.8% 


47.5% 


45.2% 


fg qL9 


42.0% 


47.1% 


49.6% 


53.3% 


fifj ~^ X2X1 


13.2% 


1.9% 







Table 10: Dominant production modes across all model variations. At a given a choice, the upper table 
indicates the percentage of models for which these modes had the largest cross section, while the lower table indicates 
the percentage for which the modes had the second-largest cross-section. All models exhibit predominantly gluino 
pair production, or gluino-quark associated production. A small fraction of a = models exhibit neutralino-chargino 
pair production. This mode 'switches off' as a is increased from zero, as the gaugino masses increase. 

largest cross-section. The lower table gives the same information for the modes that occur with the 
second-largest cross-section. The majority of models exhibit squark-gluino associated production, 
or gluino pair production as the dominant production mechanism. Approximately 13% of a = 
models, and about 2% of a = 0.33 models have neutralino-chargino production as the second most 
dominant mode. 

The particle decay behavior varies throughout the range of model simulations. However, gluino 
decays are largely insensitive to changes in a. For the case a = 0, approximately 68% of models 
have g ^ xf ''fQQ' ^ the primary decay channel (the channel having the largest branching fraction), 
while 31% of models have instead g ^ bi + b as the primary channel. The a = 0.33 and 0.66 models 
exhibit similar ratios. The a = 1.0 models show a slight variation, with the distribution shifting to 
70% and ~30% respectively. For all a values, approximately 68% of model variations also exhibit 
g ^ xf + qq' as the dominant secondary channel (having the second-largest branching fraction), 
while 30% have decays to an on-shell second-generation squark -|- quark as the secondary channel. 

The first- and second-generation squark decays are equally insensitive to variations in a. For 
all a, approximately 50% of models indicate ql ^ g + q is the primary decay channel, while the 
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other 50% have Xi as the primary channel. This is also the dominant secondary channel in 48% 
of the models. Another 40% have X2 + 9 as the secondary channel. The qr are slightly different, 
with approximately 62% of models indicating g/j — > g + g as the primary channel, and another 
37% qn ^ Xi + Q- This is also the dominant secondary channel in 63% of models, with X2 + 9 
secondary channel for another 32%, and the remaining 5% having Qr ^ g + q- 

Due to dependence on the gaugino mass parameters, the chargino decays are significantly more 
sensitive to variations of a. For the a = case, approximately 74% of models have xf ~^ + Xi 
as the primary decay channel. Another 25% have — > Xi + ^ + <?' as the primary channel 
(here the quarks are from the first or second generation), while the remaining 1% have instead 

— > Xi + + ^T- As a increases these three decay channels persist, however their distribution 
across each set of models begins to change, and additional channels begin to appear. For the 
a = 0.33 case, the above channels occur in 65%, 31%, and 1% of models, respectively. However, 
now the remaining 3% of models have Xi + i't as the primary channel. The x^ Xi + ^ + 

channel is the dominant secondary channel for all a variations. 

The X2 decay behavior is similarly diverse. For case q = 0, approximately 39% of models have 
X2 ^ Xi + qq as the primary decay channel, while 23% have X2 ~^ Xi + 28% have X2 ^ Xi + ^'^j 
and another 10% have X2 ^ Xi + t~^t~ as the primary channel. This distribution shifts slightly for 
a = 0.33 to 40%, 26%, 18%, and 13%, respectively. Here, another 3% of models have X2 ^ "^i^ + 
as the dominant channel. For a = 0.66 it is shifted further to 46%, 18%, 15%, 15%, where here the 
remaining 3.4% of models now having X2 ~^ ^eL + as the primary channel. For a = 1.0, the 
and decays occur less frequently, with only 8% and 5% of models having these as the primary 
channel. The x? + qq^ Xi + T^r~, and f + r channels appear with the largest branching fraction 
in 56%, 19%, and 9% of models, respectively. 

As with the benchmark models, we analyze the 1449 model pairs using the three signature sets 
given in Tables O [6] and [71 Due to the large number of model points we present results statistically 
in the form of the observed distribution of L^[^. Table [TT] shows the minimum luminosity required 
to distinguish between models with a = and those with a ^ when using, respectively, signature 
Lists A, B and C. Considering the case of a = 0.33 first, signature List A is able to successfully 
resolve a large number of model pairs with fairly low luminosity. However, only 241 out of the 469 
model variations analyzed for this value of a can be resolved with less than 10 fb^^. Signature 
Lists B and C exhibit significantly stronger resolving power, with List B able to distinguish 429 
variations, and List C 461 out of the 469 total model variations considered. Both Lists B and C 
allow the majority of model variations to be distinguished with < 4 fb~^ integrated luminosity. 
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Table 11: Minimum integrated luminosity Lmin to separate a = from a 7^ in controlled model 
sample. Distribution of Lmin values for the three signature sets of Tables [S] |S] and [7] In each case we are comparing 
the indicated value of a with the case a = Q for the same set of background model parameters. 

however List C exhibits the best performance overall, as it is able to distinguish the models with 
a consistently lower luminosity requirement. For the a = 0.66 models, all three signature sets 
allow the majority of model points to be distinguished from a = with less than 4 fb~^ integrated 
luminosity. Only List A was unable to resolve all model variations with less than 10 fb~^, as 73 out 
of 496 models required higher luminosity. Signature List C exhibits the best performance, allowing 
nearly all model variations to be resolved with < 2 fb~^. The a = 1.0 models are sufficiently 
different from the a = case that all three of the signature sets are able to distinguish the two 
cases with exceptionally low luminosity. Signature List C again exhibits the best performance, 
allowing all models to be distinguished with less than 1 fb^^ of data. 

We can understand these results by examining the individual {RAB)i response of each signature. 
From equation (|3.36|) . the minimum luminosity required to distinguish two models, A and B, is 
inversely proportional to Rab^ which is the sum of the individual {RAB)i values of each signature. 
Because {RAB)i reflects the sensitivity of the i-th signature to changes between models A and B 
(a larger {RAB)i value being more sensitive), signatures that have high sensitivity to physical 
changes associated with a provide a greater contribution to the total Rab, and thus reduce the 
-^min requirement. 

The distribution of Rab values for the single signature of List A is shown in Figure [T2l For 
the a = 0.33 case the distribution is localized to relatively low values of Rab- For the a = 0.66 
and a = 1.0 cases the distribution begins to spread out, with many models having significantly 
larger Rab values. This indicates the signature is becoming increasingly more sensitive to the 
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Figure 12: Distribution of (Rab) values for signature List A. The distribution of Rab values for the single 
signature of List A is given for the parameter sets a — 0.33, a — 0.66, and a = 1.0. In each case we are comparing 
the indicated value of a with the case a = for the same set of background model parameters. Note that larger 
values of Rab imply lower values of Lmin. 



differences brouglit on by changes in a as this parameter is increased. However, with only a single 
signature it is not possible to guarantee that it will be as effective for other models as it is in this 
example. In order for this approach to work across a broad range of potential physics scenarios it 
is advantageous to adopt a combination of signatures, where each may be sensitive to one or more 
aspects of a particular class of models. 

Figures [T3l and show the distributions of {RAB)i obtained for the five signatures of signature 
List B. Each figure represents five histograms where the variable being considered is logiQ[(Rab)i], 
with the comparison being between a = and a = 0.33 in Figure [13] and between a = and 
a = 1 in Figure [TH In a similar fashion to the single signature of List A, the distributions are in 
general clustered at low {RAB)i for a = 0.33, and begin to spread out considerably, taking on much 
larger values as a increases. Comparing the individual distributions to those in the single signature 
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Sig4 

SigS _4 

Figure 13: Distribution of {RAB)i values for signature List B [a = 0.33 versus a = 0]. The distribution 
of {RAB)i values for the five signatures of List B is given for the case of comparing a — with a — 0.33. For the 
definition of the five signatures, see Table |6] 

of List A, the overall spread of values is not significantly different. However, recall that Rab is 
the sum of the individual {RAB)i values. Therefore we gain a significant enhancement by simply 
including additional signatures. A similar effect occurs with the larger set of signatures in List C. 
As we saw in Section [3l however, there is ultimately a point of negative returns and a maximum 
efficacy is obtained. 

Thus far we have presented the results of our approach in terms of the minimum integrated 
luminosity required to resolve two model classes (a = and a ^ 0) using our set of optimized 
signatures. To understand why this approach works, it is useful to examine the signature results 
themselves. Figures [15] and [16] show examples of two-dimensional slices of the signature space 
"footprint" for our large set of model variations. In these figures the results have been normalized 
to 5 fb^^ of data. 

Figure [15] compares the count rates for the third and fourth signatures of List B for the case 
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Sig4 

SigS _4 

Figure 14: Distribution of {RAB)i values for signature List B [a = 1 versus a — 0]. The distribution of 
(-Ras)j values for the five signatures of List B is given for the case of comparing a = with a = 1. For the definition 
of the five signatures, see Table [6l 

a = versus a = 0.66 (left panel) and a = 1 (right panel). Figure [TBI compares the count rates for 
signatures ^^ll and ^^13 of List C for the case a = versus a = 0.66 (left panel) and a = 1 (right 
panel). In this case the two signatures are both taken from the set of events containing at least one 
lepton and five or more jets (see Table [7]). We have chosen this pair for the dramatic separation 
that can be achieved, though similar results can be obtained with other pairs of signatures. 

The power of our inclusive signature list approach lies in the choice of signatures and their 
ability to remain highly sensitive to changes in the physical behavior of each model. This feature 
is reflected qualitatively in the visual clustering of the data points, which become progressively 
more distinct as the parameter a is increased. As the regions separate it becomes increasingly less 
likely that a model from one class can be confused with a model from the other class, even when 
considering statistical fluctuations. In our approach this manifests itself when one computes Rab, 
which reflects the "distance" in signature space between the two models under comparison, and 
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Figure 15: Footprint-style plot for a pair of signatures from List B. Total counts for signature #3 versus 
signature #4 of List B is given for tlie case a = (green triangles) a ^ (black squares). The cases shown are for 
a — versus a = 0.33 (top panel), a = 0.66 (middle panel) and a = 1 (bottom panel). The axes measure the number 
of events for which the kinematic quantity was in the range given in Table |6l Larger values of the non-universality 
parameter a correspond to a greater degree of separation between the two model "footprints." 
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Figure 16: Footprint-style plot for a pair of signatures from List C. Total counts for signature =1^11 versus 
signature #13 of List C is given for the case a = (green triangles) a ^ (black squares). The cases shown are for 
a = versus a = 0.33 (top panel), a = 0.66 (middle panel) and a = 1 (bottom panel). The axes measure the number 
of events for which the kinematic quantity was in the range given in Table [T] Larger values of the non-universality 
parameter a correspond to a greater degree of separation between the two model "footprints." 
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which becomes large when the models are sufficiently different from one another. 

The idea of using repeated pairings of targeted observables in order to separate model classes 
was studied in previous "footprint-style" analyses [171 |38l |39] . If we consider the universal gaugino 
mass scenario {i.e. a = 0) as a "model," and the case of non-universal gaugino masses as a separate 
model, then a set of signatures will be truly targeted at this particular model feature if the set of all 
such two-dimensional planes implies complete separation between the models. With this in mind it 
is interesting to examine distinguishability between the two values of a from a somewhat different 
perspective. Adopting the approach of [39] we can ask how many degeneracies exist between the 
two classes of models, where by degeneracy we mean two models that exist at different points in 
the microscopic parameter space, but occupy the same point in signature space (up to statistical 
fluctuations). If it is possible, through application of one or more signatures, to ensure that no 
degeneracies exist we can claim to that it is possible to completely discriminate between the two 
classes. 

As an example of how this idea can be applied, we can consider the analysis performed in [49|. 
Let one particular value of the parameter a (such as a = 0) be "model A" and let some other 
value of the parameter a be "model B." Choose any pair of signatures in one of the signature lists. 
From our controlled sample we can choose an individual case Bj G B and compute the quantity 
{/S.SAiBj)'^ between that particular point and all the points Ai G A for this pair of signatures. If the 
value for all such (A^a^Bj)^ is always greater than the two-signature threshold given by 72(0.95) 
in Table [2] we will claim the point Bj has been separated from the entire footprint of model A. We 
can then repeat this exercise over all cases of model B. The number of cases of model B that have 
not been separated from the entire footprint of model A we will denote as Nba- This is a type of 
degeneracy count for model B with respect to model A. Clearly the process can be performed for 
model A with respect to model B, producing a degeneracy count Nab- In general we expect these 
two numbers to be roughly equivalent in magnitude, but not necessarily precisely equal. 

If either Nab or Nba are non-vanishing then the two footprints are not yet disjoint in the 
multi-dimensional signature space. We can then choose any other pair of signatures and repeat 
the procedure, this time restricting Ai and Bj to run only over the degenerate cases. If we have 
chosen a good set of signatures the quantities Nab and Nba should rapidly converge to zero as the 
algorithm is successively applied. The results of performing this exercise on the controlled model 
sample generated by the parameters of Table [9] is shown in Figure [T71 In the left panel we show 
the successive values of Nab and A'^^^ as pairs of signatures from List B are used to compute the 
separability parameter {IS.Sab)'^) while the right panel uses pairs of signatures from List C. In both 
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Figure 17: Degeneracy counts for List B (left panel) and List C (right panel). The relative degeneracy 
counts Nab and Nba that result from successive application of pairs of signatures from List B and List C are plotted 
for our controlled model sample. In each case model A is the case with a = while model B is the case with the 
indicated value of a 7^ 0. Once all model pairs have been applied the total degeneracy count vanishes for both lists 
and for all values of a 7^ 0. 



cases "model A" represents the set of models with a = 0, while "model B" represents the case with 
the indicated value of a = 0.33, 0.66 and 1.0. For all three values of the parameter a the lists do 
an excellent job of converging towards Nab = Nba = after only a few pairings are considered. 
This suggests that the signature lists of Tables [6] and [7| should be able to reveal the departure of 
the gaugino soft masses from the universal ratios on a truly general super symmetric model with a 
high degree of reliability and in a small amount of integrated luminosity. 

To honestly confirm this hypothesis we must generate a more random set of models. After all, 
the signature lists of Tables [H [6] and [7] were constructed precisely with the sorts of models of our 
controlled sample in mind. But as we saw in Section 14.11 models such as benchmark model A can 
prove more challenging for our analysis algorithm. To allow for the possibility of more perverse 
cases than those of our controlled sample, an additional set of 500 models were generated with six 
points on the a-lines ranging from to 0.5. In this case a 16-dimensional parameter space defined 
by the quantities in (j3.18p was considered. Specifically, slepton and squark masses were allowed 
to vary in the range 300 GeV to 1200 GeV with the masses of the first and second generation 
scalars kept equal. The gaugino mass scale given by M3 and the ;U-parameter were also allowed 
to vary in this range. The pseudoscalar Higgs mass ruA was fixed to be 850 GeV and the value 
of tan (3 was allowed to vary from 2 to 50. If all points along the a-line satisfied all experimental 
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Figure 18: Efficiencies of the three signature hsts. The ability of the three signature lists to separate the case 
a — 0.1 from a = is indicated in the top pair of plots and the simpler case a = 0.3 from a = in the bottom pair 
of plots. On the left, the percentage of cases that could be distinguished using each of the three signature lists of 
Tables [5l [6] and [7] is given as a function of integrated luminosity in units of fb~^. On the right the same percentage 
is shown as a function of the number of supersymmetric events. The 95% separability threshold is indicated by the 
dashed horizontal line. 



constraints on the superpartner mass spectrum, then 100,000 events were generated for each of the 
six points along the a-hne in the manner described in Section [3l Using this data the value of Lmin 
was computing using (|3.37p and (|3.38p for each of our three signature sets. 

The results of this analysis are given in Figures [18] and [T9j Figure [18] considers the ability of 
our signature lists to separate the case a = 0.1 from a = (top pair of plots) and the simpler 
case a = 0.3 from a = (bottom pair of plots). On the left, the percentage of cases that could 
be distinguished using each of the three signature lists of Tables [5] [6] and [7] is given as a function 
of integrated luminosity in units of fb~^. Since the random model sample includes examples with 
very different superpartner mass scales, the overall supersymmetric production cross-section varies 
much more across this sample than in the controlled model sample described above. We therefore 
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Figure 19: Lmin and A'^niin required to detect a 7^ for 95% of the random models. 



take this into account by plotting the same percentage in terms of the number of super symmetric 
events on the right side of Figure [HI The 95% separabihty threshold is indicated by the dashed 
horizontal line. Even using our best set of signatures (List C) it will require nearly 100 fb"^ to 
be able to detect non-universality at the level of a ~ 0.1 for an arbitrary supersymmetric model. 
Yet for the vast majority of models the departure from universality should become apparent after 
just 10-20 fb^^. Departures from universality at the level of a ~ 0.3 should be apparent using 
this method for most supersymmetric models after just a few fb^^. In Figure [19] the integrated 
luminosity (or number of supersymmetric events) needed to detect a 7^ for 95% of our random 
models is given as a function of the five non- vanishing a values simulated. 



5 Conclusions 

If supersymmetry is discovered at the LHC the high energy community will be blessed with a large 
number of new superpartners whose masses and interactions will need to be measured. At the 
same time the community will be cursed by a large model space with many Lagrangian parameters 
which cannot themselves be directly measured experimentally. Undoubtedly performing global fits 
of the many observables to the parameter space of certain privileged and well-defined benchmark 
models will be of great help in making sense of this embarrassment of richness. But recent work 
suggests that unless these models are determined by very few parameters it is likely (if not perhaps 
inevitable) that multiple points in the parameter space will fit the data well. It then becomes an 
interesting question to ask whether it is possible to fit to certain model characteristics rather than 
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to any particular model itself. 

In our opinion one of the most important such characteristic is the pattern of soft supersymmetry- 
breaking gaugino masses. No other property of the low-energy soft Lagrangian is more easily linked 
to underlying high-scale physics, particularly if that high-scale physics is of a string-theoretic ori- 
gin. Only the related issue of the wavefunction of the LSP is of more importance to low-energy 
physics and cosmology. We are thus interested in asking whether we can identify the presence 
on non-univcrsalitics in the gaugino sector independent of all other properties of the supcrpartncr 
spectrum. The manner by which any such undertaking can be tackled is by no means clear - though 
neither is it clear that such an undertaking is inherently impossible. In the present work we have 
decided to begin this process with a simple parametrization of the gaugino masses determined by 
a single parameter which can be thought of as the ratio of bulk gravity and anomaly contributions 
to gaugino masses. We developed model "lines" in the spirit of previous benchmark studies such 
as the Snowmass Points & Slopes in which only the single non-universality parameter is varied. 
By understanding how the observable physics at the LHC is affected by this parameter - and then 
repeating the analysis many times with the other supersymmetric parameters varied - wc can learn 
which LHC signatures are most directly "targeted" at this important underlying characteristic. 

Our procedure depends on certain analytic results that improve on the methods first introduced 
concretely by Arkani-Hamed et al. These analytic results in turn depend on the assumption that 
the signatures considered have fluctuations which are largely uncorrelated with one another. This 
severely limits the type of signature ensembles one might construct. Yet this restriction does not 
imply a loss of resolving power, as the "optimal" signature list is rarely the largest possible list 
one can imagine. Our analysis has suggested two signatures ensembles which perform remarkable 
well at the task of measuring the value of the non-universality parameter we introduce. Broadly 
speaking, we find that a non-universality at the 10% level can be measured with 10-20 fb~^ of 
integrated luminosity over approximately 80% of the supersymmetric parameter space relevant for 
LHC observables. If we are interested in measurements at only the 30% level these numbers change 
to 5-10 fb~^ over approximately 95% of the relevant parameter space. 

This is remarkable progress, but the task wc set out for ourselves is admittedly still somewhat 
artificial. There are two independent mass ratios that can be constructed from the three soft 
supersymmetry breaking gaugino masses - our parametrization is therefore not fully general. It 
would be of great interest to study more general departures from non-universality to see if the 
optimal signature lists change substantially. Of greater import is the need to perform a Monte 
Carlo simulation in order to compare a candidate model to the "data" at the LHC. To perform 
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such a comparison we must assume knowledge of all input parameters apart from the one we are 
attempting to measure. While this is a common practice in benchmark studies at colliders, it is 
far from the reality that theorists and experimentalists will encounter in the early stages of the 
LHC era. Our study demonstrated the efficacy of certain targeted observables in extracting the 
non-universality parameter a while keeping all other parameters fixed for the two models. This is 
quite a strong assumption and future work should relax this constraint. In other words, one would 
like to distinguish between two models (with different values of a) even if the other parameters 
for the two models are not the same. There are many directions by which this may be pursued. 
For example, in the current analysis we have not allowed ourselves any knowledge of the mass 
spectrum, though analysis of kinematic end-points will certainly provide some information in this 
regard early on in LHC data-taking. In addition, techniques such as the use of on-shell effective 
theories [50] might provide sufficient information about the dominant production and decay modes 
for new mass eigenstates to allow an approximation to our analysis to be performed before the full 
mass spectrum is reconstructed. We hope to pursue both avenues for introducing greater realism 
in future work. 
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Appendix: Some Specific Examples 

The low energy limit of four-dimensional string constructions can be studied as a supergravity 
theory defined by three functions of the massless chiral superfields: the Kahler potential K, the 
superpotential W and the gauge kinetic function fa- As in the previous subsection the label a 
refers to a particular gauge group Qa- This last function is naturally of most import for the soft 
supersymmetry breaking gaugino masses, but all three functions play a role in determining the 
nature of supersymmetry breaking in the observable sector. In string models the gauge kinetic 
function is typically determined by gauge-singlet chiral superfields which we will simply refer to 
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as moduli. The Kahler potential and superpotential are generally functions of both moduli and 
gauge-charged matter super fields. 

Let us assume, as is so often the case at tree-level in string theory models, that the gauge kinetic 
function is linear in the modul0 

fa = kaXa. (A.l) 

Here Xa represents a generic modulus field and we have allowed for the possibility that each gauge 
group can have its own modulus dependence. The proportionality constant ka can be thought of 
as the affine level at which the gauge group Qa is realized in the underlying conformal field theory. 
We will hereafter always set this constant to unity. Note that the real part of the lowest (scalar) 
component of Xa must acquire a vacuum expectation value (VEV) in order to determine the size 
of the corresponding gauge coupling 

< Re a:a > = ^ = ^, (A.2) 

where Xa = ^ale=e=o- If wish to entertain the notion of gauge coupling unification then we 
must either arrange these VEVs to be equal, or there must be a single universal modulus X to 
couple to all gauge groups with equal strength. The latter is the case for the two models we will 
consider below so we will assume 

fa = X; <Rex> = l/5|^^, (A.3) 

where i^str is the universal gauge coupling at the string scale. The highest component of the chiral 
superfield X is the auxiliary field . A non-zero expectation value for this field, or indeed of 
any other such auxiliary field, is an indication of spontaneous breaking of supersymmetry. The 
manifestation of this supersymmetry breaking in the form of gaugino masses is given (at tree level) 
by the expression 

Ma = ^F^'dNfa, (A.4) 

where the repeated index n sums over all chiral superfields present in the function fa and the 
expression on the left is understood to be evaluated in the vacuum. For the case of (jA.Sp this 
implies 



Ma 




(A.5) 



^Unless explicitly written otherwise, in this appendix we are always using Planck units in which Mp, 
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where in the last hne we have made exphcit the vacuum evaluation at the string scale. Now we 
have arrived at a universal contribution to the three gaugino masses of the Standard Model, which 
gives rise to the term M^(Auv) of (|2.3p . 

Additional contributions to ()A.4p appear at the loop level. The structure of these terms have 
been computed elsewhere [351 IMl EI] and they generally depend on details of the complete theory 
beyond the form of the tree level gauge kinetic function. A subset of these terms can be derived 
completely from the superconformal anomaly, the most important of which is universal for any 
supergravity theory 

' 2 167r2 3 ^ ' 

The coefficient ha is as defined in (j2.5p and the field M is the auxiliary field of the supergravity 

multiplet whose expectation value determines the gravitino mass 771,3/2 = —\ (-^)- -'^^ limit 
where this is the only significant one loop correction to the gaugino masses we recover the expression 
in (|2.4p where Mg = m^i2- 

We now have our two components to the mirage gaugino mass pattern. Our next task is to ask 
how the magnitudes of M„ = (p-^ / {x + x)^ and Mg = — /3 might be related to one another. 
As both P-^ and M are auxiliary fields their equations of motion are easy to obtain, relating these 
quantities to the Kahler potential and superpotential via 

F^' = -e'"^K''^ iWj^ + Kj^w), M = -3e^/2T^ (A.7) 

with Wjj = dW/dZ^ , Kj^ = dK/dZ^ and K^'^^ being the inverse of the Kahler metric K^^j^ = 
d'^K/dZ^^ dZ^ . Here Z^ represents any chiral superfield, including our particular modulus X from 
the gauge kinetic functions. Given a specific model of supersymmetry breaking - such as gaugino 
condensation - the modulus dependence on the non-perturbatively generated superpotential terms 
can be computed and ()A.7p can be used to explicitly relate the size of the gravitino mass to the 
size of (^F-^^. However, if we make the assumption that the scalar potential has vanishing vacuum 
expectation value in the ground state of the theory then we can bypass this complication and use 
this assumed constraint directly [52]. The scalar potential is given by 

V = Kj^j^F^'f" - ^MM (A.8) 

where repeated indices are again summed. The condition {V) = immediately relates the auxiliary 
F-terms to the gravitino mass. In particular, if F^ is the only non-vanishing i^-term component 
in the theory then we have 

y) = V3m,/2({K.,)-'^') , (A.9) 
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up to a possible phase. For the moduli we will consider in this paper the tree level Kahler potential 
is typically Ktj:ee{X, X) = — ln{X + X) and thus the imposition of vanishing scalar potential in the 
vacuum implies 

^" = ( (^Ti) ) = ^""3/2 = VsMg . (A.io) 

Clearly such a situation will not result in the ratio r = Mg/M^ ~ 0(10 — 100) and therefore if (jA.lOp 
holds the contribution from (|A.6P will be only a small perturbation on the universal contribution 
from (lA.Sp . 

But this is where a thorny "problem" for string phenomenology becomes an opportunity. The 
problem is that the vast majority of simple, explicit models of dynamical supersymmetry breaking 
(such as the gaugino condensation mentioned above) do not produce vanishing vacuum energy. In 
other words, when the values of and (^^'^'^ are computed from first principles via (|A.7p the 
relation in (jA.Oh typically fails to be true. This is often considered an embarrassment for string 
models and much effort in string phenomenology is devoted to stabilizing moduli and breaking 
supersymmetry while simultaneously achieving {V) = 0. While many solutions have been postu- 
lated through the years, we can group them here into two broad classes. In the first class the 
simple structure of the scalar potential in (lA.Sp is retained, with a single modulus carrying non- 
vanishing auxiliary VEV, but the Kahler potential is assumed to differ from the tree level form 
Ktj-eeiX, X) = — ln{X + X) so that (jA.lOp is modified and {V) = is obtained. For the second class 
a new sector is brought into the theory to produce a new contribution to the scalar potential 6V of 
approximate magnitude 6V ~ "^3/2 • ^^^^ sector does not interact with the observable sector 
then its sole impact is to approximately cancel the large (negative) vacuum energy associated with 
the second term in (jA.Sp . leaving (^P'^'^ essentially disconnected from the size of the gravitino mass. 
As we will see below, explicit examples of both classes of solutions have the remarkable property 
of giving rise to the same general pattern of gaugino masses as in (I2.16p . 

A.l Class 1: Kahler Stabilization Models 

As an example of the first class of models we will consider the weakly coupled heterotic string models 
studied by Binetruy, Gaillard and Wu (BGW) [MlIM! and reviewed in [29]. The presentation here 
will follow that of [37] from which we will take our benchmark scenario. 

For the heterotic string gauge coupling unification is a result of a single modulus, the dilaton 
5, appearing universally in all gauge kinetic functions. The BGW construction postulates the 
existence of some non-perturbative correction to the action for the dilaton field, along the lines of 
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that originally suggested by Shenker [55], which results in a modification of the Kahler metric for 
the dilaton scalar. Borrowing the notation of (|A.9p with X ^ S" it is sufficient for our purposes to 
parameterize this modification as follows 

= V3m,/2(.Ks,)-'/' = V3m,/,a^^{Klr)-^/^, (A.ll) 

where we have introduced the parameter 



tree 



1/2 

I (A.12) 



designed to measure the departure of the dilaton Kahler potential from its tree level value. Recall 
that {(Klr)'^') = {l/{s + s)) = glJ2 1/4 and s = S\s=e=o- 

In order to be more concrete we must build a model for supersymmetry breaking in which ttnp 
is calculable. Here we will take an indirect approach. Consider the field-theoretic non-perturbative 
phenomenon of gaugino condensation. Using the relation between the dilaton and the gauge cou- 
pling it is easy to see that the effective superpotential generated by the gaugino condensate will 
have the form W{S) oc (e-(8'^Vfea)5)3 

were ba is the beta-function coefficient of a condensing gauge 
group Qa of the hidden sector. Let us simplify things by assuming a single condensing gauge group, 
which we will denote by with beta-function coefficient 6+ = 6a/167r^. The values of can be 
quite a bit larger than analogous values for the Standard Model groups, but a limiting case for the 
weakly coupled heterotic string is that of a single £"8 gauge group condensing in the hidden sector, 
so that = Qes and 6+ = OO/lGvr^ = 0.57. Clearly we must insist 6+ > in order for gaugino 
condensation to happen at all. 

However, if we do not insist on the tree level dilaton Kahler potential then the vanishing of the 
vacuum energy implies [56] 



26. 



3 = ^ ^ (A.13) 

1 - ^b+Ks 



where we have used the equations of motion (|A.7p for and W{S) = . So provided 

Kg ~ so that Ksb+ <C 1 we can immediately see that a Kahler potential which stabilizes 

the dilaton while simultaneously providing zero vacuum energy will necessarily imply a suppressed 
dilaton contribution to soft supersymmetry breaking. Indeed, from ()A.12p 



2 ffsTR 5 



1 - iK^b, 
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b+ 



Figure 20: Effective value of a as a function of b+ in the BGW class of models. The parameter b+ 
represents the beta-function coefficient of the largest gauge group which experiences gaugino condensation in the 
hidden sector. This parameter controls the effective value of a for the gaugino masses at the electroweak scale. Since 
the largest possible confining group would be Es there is a minimal size to the effective a parameter in this class of 
theories. The benchmark point considered in the text corresponds to b+ — 36/167r^. 



and 

r = M,/M.. = ( (£±f) ) = ^ » 1 ^ (A.15) 

It is not hard to construct explicit examples which achieve the outcome in ()A.13P and ()A.14p [57\ [58] . 
The value of the parameter a associated with ()A.15P can be readily computed from (j2.15p 

1 

a = r . (A.16) 

Using ()A.14p and the assumption that (Kg) = —g1,^^/2 we can plot the predicted value of a as 
a function of condensing group beta- function coefficient bj^. The result is shown in Figure [20l 
Note that the largest possible value of 6+ (6+ = = 90/167r^ = 0.57) corresponds to the smallest 
possible a value. We immediately see that if this class of models is realized in Nature then the a — > 
limit cannot be obtained and departures from the mSUGRA gaugino mass regime are a prediction 
of the theory. Our benchmark point will take 6+ = 36/167r^ which corresponds to a^p = 1/15.77. 
Such a value for could arise from a condensation of a sector consisting only of Eq Yang-Mills 
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fields and no matter charged under the Eq group. This benchmark point was studied in [37J and 
we give the expHcit soft supersymmetry breaking mass parameters for the point in Table [U at the 
end of this Appendix. 

The corresponding effective a value at the scale Aew = 1 TeV is a = 0.28. We note that 
extraction of the value of a from low-scale gaugino soft masses depends on the renormalization 
group scale, as is apparent from expressions such as (|2.13|) and (j2.16p . In particular, the value of 
a = 0.28 can be extracted using the ratios in (jl.ip provided the gaugino masses are evaluated at 
the scale Aew = 1 TeV. The value of a at other scales can be found by using the more general 
formula (|2T3D . 

A. 2 Class 2: Type IIB Models with Flux Compactifications 

The second type of solution to the vacuum energy problem - introducing a new sector whose pur- 
pose is to cancel negative contributions to the vacuum energy arising from the last term of (|A.8P 
- is realized in certain constructions of Type IIB string theory compactified on Calabi-Yau orien- 
tifolds [27]. In this class of theories NS and RR three- form fluxes are introduced to stabilize many 
of the moduli upon compactification. The presence of this flux warps the bulk geometry of the 
Calabi-Yau, resulting in a "throat" of the Klebanov-Strassler type [59j. At the infrared end of this 
throat a hidden sector gaugino condensate exists on a set of Dy-branes and is thus "sequestered" 
from the observable sector, in the language of Randall and Sundrum For gauge theories living 
on D'r branes the gauge coupling is determined by the Kahler modulus T, as opposed to the dilaton 
S of the heterotic example presented previously. But apart from this small notational change much 
of the phenomenology is strikingly similar to the example in the previous subsection. 

The Kahler potential for the modulus T is again taken to be if = Kt^ee(T,T) = — ln(T + T) 
and we assume that the Standard Model exists on a second collection of Dj branes such that the 
(tree level) gauge kinetic functions for the Standard Model gauge groups are universal and of the 
form fa = T. In the effective supergravity theory just below the string compactification scale, 
the presence of the three-form fluxes is represented by a constant Wq in the effective superpoten- 
tial. Combined with the effect of gaugino condensation in the hidden sector the total effective 
superpotential is then 

W = Wo + J2^ie~'''^ ■ (A.17) 

i 

For simplicity, let us assume a single condensate from the gauge group with coefficients = 1 
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and a = a_|-. To make contact with the notation of the previous section we need merely identif}0 

a, ^ A . (A.18) 

Minimizing the resulting scalar potential V{t,t) with t = T\g^g^Q generates a non-vanishing value 
for (t + 1) at which the auxiliary field F'^ vanishes |60j . Restoring the Planck units to the second 
term in (jA.SP we see that the vacuum must therefore have an energy density given by {V) = 
—3m'^^2^PL- The size of the VEV for Ret, as well as the size of the gravitino mass 
determined by the size of the constant term Wq in ()A.17p . In particular we have [60] 

(a+Ret) ~ ln(l/t^o) 

Wo 

(2(Ret))3/2 ■ 



"^3/2 - ^^PLTTTTJTTT^ ■ (^-19) 



An acceptable phenomenology requires that the constant Wq be finely-tuned to a value Wq ~ 
0(10"^^) in Planck units. That such a fine-tuning is possible at all is a particular feature of 
Type JIB compactifications with three-form fluxes. Combining the two relations in ()A.19p we see 
that the model will assume an appropriate value of Wq such that 

(a+Re t) ~ ln(MpL/m3/2) . (A.20) 

The remaining component to the model is the sector that resolves the issue of the large negative 
vacuum energy. Here it is postulated that at the far tip of the Klebanov-Strassler throat there is 
an additional source of supersymmetry breaking. In this case we assume the presence of Ds-branes 
which break supersymmetry explicitly. Being at the end of the warped throat the effect of this hard 
supersymmetry breaking is presumed to be mild on the observable sector D7-branes. The vacuum 
stabilization for the Kahler modulus t = T|g^g^Q is thus largely unaffected. Being an explicit 
breaking of supersymmetry it is not possible to perfectly capture the effects of the Z)3-branes in 
the form of corrections to the supergravity effective Lagrangian in superspace. However, it can be 
approximated [611 [38] by assuming a correction to the pure-supergravity part of the action 



£ 9 -2 y d^dE ^ -2 y d*^ p + P{T, T)J (A.21) 

which gives rise to a new contribution to the scalar potential for the modulus T. When the modulus- 
dependence of P{T, T) is trivial and P{T, T) = C then the resulting scalar potential contribution 



^We have changed notation so as to ease comparisons with the original literature. 
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is simply 

H,» = ^. (A.22) 

and the more general case of P{T, T) = C(T + T)" gives rise to 

Under these conditions the equations of motion for the auxiliary field for the Kahler modulus has 
the approximate solution 

/ \ 2-n , , 

To see how this generates a mirage pattern of masses, we look again at the ratio r = Mg/M^ 



{t + t) \ _ a+{t + F) 



3/2 ( ^jj^ ) = ^r-;r^ ^ ln(Mp,/m3/2) » 1 . (A.25) 



n 



Provided the VEVs in ()A.19p can be arranged, the mirage pattern of gaugino masses necessarily 
follows. The implied value of a follows from the definition in (|2.15|) 

a = + O (ln(m3/2/MpO) • (A.26) 

In the minimal case with n = we therefore have the prediction that a ~ 1 for this class of theories. 
We note that in the case n = we can rewrite the quantity r in ()A.25P in the following way 

r = a+(Ret) = a+J- ^ = _^ . (A.27) 

Our two classes of theories are very different, yet they both result in a mirage pattern of gaugino 
masses in which the relative sizes of the contributions to soft supersymmetry breaking depend on 
the hidden sector gaugino condensation in a similar manner, as seen by their functional dependence 
on the parameters 0+ and/or 6+. Should we find this surprising? Perhaps not, since both aim to 
solve the same problem (namely, large negative vacuum energy) using the dynamics of a single real 
scalar field. And both methods ultimately involve adding a correction to the action for this real 
scalar of the form ()A.2ip F^ The requirement that (14otai) = in the ground state then dictates the 
necessary values for the parameters such that the ratio r = Mg/M^ dependence on the gaugino 



^^Furthermore, in both constructions there are elements of this addition (|A.2ip that are not under full calculational 
control. 
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condensate is as in ()A.27p . We hasten to add, however, that the two models are indeed quite 
distinct in other regards. In particular they make quite different predictions for the other soft 
supersymmetry breaking parameters. For the case of the flux compactiflcations of Type IIB we 
refer the reader to the relevant hterature \38\ |62] for more details on how these additional terms 
are computed. We have chosen as a benchmark point a scenario studied in [38] in which n = so 
that a = 1. The overall scale was treated as a free parameter in [35] and we here take that scale to 
be 771,3/2 = 16.3 TeV. The precise values of the soft supersymmetry breaking parameters for both 
benchmark models are collected in Tabled] in Section [2] of the main text. 
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